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REAL-TIME SOFTWARE RECEIVER 

CROSS REFERENCE TO RELATED 
APPLICATIONS 

The present application claims priority to U.S. Provisional 
Application No. 60/439,391 filed Jan. 10, 2003 entitled 
REAL-TIME SOFTWARE RECEIVER which is incorpo- 
rated herein in its entirety by reference. 

STATEMENT OF GOVERNMENT INTEREST 

This invention was made with United States Government 
support from the Office of Naval Research (ONR) under 
contract number N00014-02-J-1822 and from the National 
Aeronautics and Space Administration (NASA) under con- 
tract numbers NCC5-563, NAG5-11819, and NAG5-12089. 
The United States Government has certain rights in the 
invention. 

BACKGROUND OF THE INVENTION 

This invention relates generally to software radio receiv- 
ers, and more specifically to a software receiver for posi- 
tioning systems. 

A typical positioning system receiver, such as is used in 
the Global Positioning System (GPS), includes an antenna, 
a radio frequency (RF) section, a correlator, a signal tracking 
and demodulation component, and a component to compute 
the navigation solution. The antenna, which is possibly 
followed by a pre-amplifier, receives L-band GPS signals. 
The RF section filters and down converts the GHz GPS 
signal to an intermediate frequency in the MHz range. The 
RF section also digitizes the signal. The correlator separates 
the down-converted signal into different channels (ten or 
more in modern receivers) allocated to each satellite. For 
each satellite, the correlator mixes the Doppler-shifted inter- 
mediate frequency signal to baseband by correlating it with 
a local copy of the carrier replica signal and it distinguishes 
the particular satellite by correlating the signal with a 
pseudo-random number (PRN) code. Software routines 
cause the carrier replica and PRN replica signals to track the 
actual received signal, extract the navigation message, and 
compute the navigation solution. 

Baseband mixing is a multiplication of an input signal by 
a complex exponential where the frequency of the complex 
exponential approximately matches that of the input signal. 
The resultant signal is centered at baseband. A complex 
signal can be broken down into cosine and sine signal 
components, resulting in separate in-phase and quadrature 
components. The frequency of the baseband mixed signal 
must be controllable to within a few millihertz in the case of 
a phase-locked loop for use in a precision navigation system, 
and the baseband mixed signal must have a continuously 
varying phase. In a hardware correlator, local oscillators 
generate cosine and sine signal components that have pre- 
cise frequency control and a continuous phase. Generating 
cosine and sine signal components on the fly with the correct 
frequency and phase is too time consuming to be feasible for 
a software correlator. Instead, the software correlator gen- 
erates cosine and sine signal components on a grid of 
frequencies off-line. These signal components must be 
stored on a time grid of points sampled at the RF front-end 
sampling frequency, for example, at 5.714 MHz for one 
particular RF front-end hardware configuration, and the 
signals must last for a typical accumulation period, e.g., for 
a 0.001 second coarse/acquisition (C/A) PRN code period 
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when working with GPS LI civilian signals. It takes tens of 
gigabytes of memory or more in order to brute-force store all 
frequencies on a one mHz grid ranging from -10 KHz to 
+10 KHz, which is the needed frequency range when 
5 tracking GPS satellites from a terrestrial receiver, and addi- 
tional storage is required to store a grid of possible starting 
phases at each frequency point. 

PRN code mixing is a multiplication of a baseband mixed 
signal by a prompt +1/-1 PRN code or by a +2/0/-2 
to early-minus-late PRN code, where the code timing and 
frequency approximately match that of the input signal. The 
resultant signal is a constant in the case of prompt PRN code 
mixing, and an approximately linear function of the code 
timing error in the case of early-minus-late mixing. A 
15 receiver accumulates both of these correlation outputs. The 
magnitude of the prompt accumulation indicates signal 
strength and whether a signal has been detected, and its 
in-phase (real) and quadrature (imaginary) components are 
used to measure carrier phase and Doppler shift. The mag- 
20 nitude of the early-minus-late accumulation measures the 
code timing error; it will be zero when the timing error is 
zero. 

The code phase of the baseband mixing signal must be 
controllable to within a percent or less of a PRN code chip 
25 for use in a precision navigation system. In a hardware 
correlator, local oscillators generate the prompt and early- 
minus-late PRN code replicas. A software correlator can 
either compute and store PRN code replicas, or compute 
them in real-time. 

.50 The current Global Positioning System is slated to realize 
expanded capabilities that include new civilian codes on the 
L2 frequency, a new L5 frequency, and new codes (M-code, 
CL and CM codes) on the L2 frequency. Some of these 
upgrades are slated to start within one to three years. A 
35 hardware correlator requires hardware modifications in 
order to use these new signals. In the near term, a receiver 
designer will be faced with a complex trade-off in order to 
decide whether the extra complexity is worth the improved 
performance that will accrue only very slowly as new GPS 
40 satellites replace older models. One way to avoid the com- 
plex trade-off is to use a software receiver that can receive 
and process new signals without the need for a new corr- 
elator chip set. 

A software receiver is flexible because its software com- 
45 ponents can be easily modified. One application of a soft- 
ware receiver is to merge together numerous devices that use 
wireless digital communication protocols to form a single 
device. For example, a cell phone, GPS receiver, and Per- 
sonal Data Assistant (PDA) could become a single device 
50 that plays the role of all three. Another use of a software 
receiver is to shorten development and to-market times for 
new wireless devices. For example, as new frequencies and 
codes are added to GPS, a software receiver having a 
software correlator simply needs to be reprogrammed, while 
55 a hardware approach would require a brand new correlator 
chip design. New PRN codes can be used simply by making 
software changes. Thus, software receiver technology less- 
ens the risks involved for designers during the period of 
transition to the new signals. Furthermore, a software 
60 receiver could be reprogrammed to use the Galileo system 
(European GPS) or GLONASS (Russian GPS). 

In the recent past, GPS software receivers have been 
developed that either post-process stored signals or operate 
in real-time. Previous real-time software receivers function 
65 with a limited number of channels (4-6) or require high-end 
computer speeds or digital signal processor (DSP) chips 
such as are disclosed in Real-Time GPS Software Radio 
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Receiver, Akos et al„ ION NTM 2001, 22-24 Jan. 2001, 
Long Beach, Calif., pp. 809-816 (Akos 2001a), and Global 
Positioning System Software Receiver ( gpSrx ) Implementa- 
tion in Low Cost/Power Programmable Processors, Akos et 
al., ION GPS 2001, 11-14 Sep. 2001, Salt Lake City, Utah, 
pp. 2851-2858, both incorporated herein in their entireties 
by reference. 

Therefore, it is an object of the present invention to create 
a software receiver that operates in real-time and is not 
restricted to a severely limited number of channels or to a 
very fast processor. 

Another object of the present invention is to minimize the 
number of sine and cosine signal components that must be 
stored. 

A further object of the present invention is to process 
incoming signals through bit-wise parallelism. 

A still further object of the present invention is to process 
over-sampled signals by use of bit-wise parallelism. 

A still further object of the present invention is to use very 
long over-sampled PRN codes efficiently in a bit-wise 
parallel software receiver. 

SUMMARY OF THE INVENTION 

The objects set forth above as well as further and other 
objects are addressed by the present invention. The solutions 
and advantages of the present invention are achieved by the 
illustrative embodiment described herein below. 

The software receiver system and method of the present 
invention enable the efficient execution of a set of algo- 
rithms, that perform software correlation on data sampled 
from incoming channels, on a general purpose processor. 
The system and method of the present invention provide for 
either PRN code storage or computation of PRN codes in 
real-time. PRN code storage is appropriate for PRN codes 
that have short periods, such as the GPS coarse/acquisition 
codes, which are 1023 chips long. In this case, the system 
and method of the present invention pre-compute over- 
sampled replicas of entire PRN code periods and store them 
for orderly and efficient retrieval, such as in a table. This 
table can include a selection of code start times as measured 
relative to the sample times at which RF data are available 
from the receiver front end. There is a separate table for each 
unique PRN code. 

The system and method of the present invention can also 
generate over-sampled versions of the prompt and early- 
minus-late PRN codes in real-time through use of an over- 
sampling function described herein. The values of the over- 
sampling function can be located in a specially designed 
table that can be generic across PRN codes. The length of the 
specially designed table can be independent of the length of 
the PRN code whose replica is being used to process a given 
received signal. The system and method of the present 
invention include techniques for efficiently calculating indi- 
ces into the specially designed table that enable rapid, 
real-time table look-up. 

The system of the present invention includes a software 
correlator that can mix the received signal to baseband, 
compute baseband/PRN correlations through bit-wise par- 
allelism and look-up tables using either the tabulated or 
real-time-generated PRN codes, and compute accumulations 
through bit-wise parallelism and processor instructions or 
look-up tables. Bit-wise parallelism allows the processing of 
multiple data samples simultaneously as the multiple bits of 
a given word of computer data. For example, for 32-bit 
words, the software correlator can process up to 32 samples 
at a time. Bit-wise parallelism can optimally operate when 
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each signal in question can be represented by only a few bits, 
which is normally the case in RF digital signal processing of 
navigation signals. 

The bit-wise parallel operations of the present invention 
5 can save computation time in comparison to integer math- 
ematical correlation operations. If, for example, four accu- 
mulations are required per sample, integer mathematics 
requires six multiplications and four additions per sample 
(except for the last sample). At a sampling rate of, for 
10 example, 5.714 MHz this translates into 57,140 integer 
operations per PRN code period. In the illustrative embodi- 
ment, 33,500 bit-wise parallel operations are necessary per 
PRN code period when the RF signal has a 2-bit represen- 
tation. This operation count is further reduced to approxi- 
15 mately 16,750 bit-wise parallel operations per PRN code 
period when the RF signal has a 1-bit representation. Thus, 
there can be a savings of almost a factor of two to almost a 
factor of four in the operation count. 

The system and method of the present invention also 
20 include a table of pre-computed baseband mixing sine 
waves, algorithms that can produce correlation accumula- 
tion outputs that are equivalent to what would be produced 
by a continuously variable sine wave, and a method of use 
of the table and algorithms. Thus, in the present invention, 
25 a relatively small set of sine wave values need to be 
pre-computed and saved, which can conserve computer 
memory and processing time. 

The present invention also includes a system and method 
for tracking the phase of PRN code replicas in software in 
.to order to track the timing of any given "chip” of the PRN 
code replica as measured with respect to a pre-specified set 
of sample times at which the basic raw data comes out of the 
RF front end (a chip is an element of a PRN code). The PRN 
code phase is kept track of via a variable for each channel, 
35 that indicates the PRN code start time with respect to the RF 
sample times. The system and method of the present inven- 
tion allow for the synchronization of the measurements of 
PRN code phase, carrier phase, and carrier frequency for 
each satellite relative to these sample times. 

40 The method for tracking the phase of each PRN code 
replica and the phase of each carrier replica includes the 
steps of latching all the C/A code phases, carrier phases, 
epoch counters, and carrier frequencies for each satellite at 
a pre-specified time, and computing the pseudo range to 
45 each satellite using the C/A code phase and epoch counters. 
The method also includes the step of tracking and updating 
code and carrier phases by estimating code chipping rate and 
carrier Doppler shift inputs. The method further includes the 
step of computing the code phase at the pre-specified time 
50 for each satellite as a function of the updated code chipping 
rate and the pre-specified time. The method further includes 
the step of computing the carrier phase at the pre-specified 
time as a function of the updated carrier phase, the Doppler 
shift, and the pre-specified time. The timing of the PRN code 
55 phase (or chip location) is the most fundamental of GPS 
measurements for use in navigation data processing. The 
monitoring of these times in software allows complete 
control of the precision with which they can be measured, 
and it allows precise synchronization of these times with the 
60 measurement times of data from other sensors, such as 
inertial measurement units. This feature gives an enhanced 
ability to develop what are known as deeply coupled sys- 
tems that must fuse GPS data with data from other types of 
sensor systems. 

65 The software correlator of the present invention can 
advantageously be easily adapted to accept signals at any 
frequency, new PRN codes, or even signals for different 
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types of devices. Thus, the same processing hardware could 
use the software correlator to implement such devices as a 
GPS receiver, a cell phone, or both. To allow for new codes, 
new frequencies, and new types of functionality, small 
changes can be made in the software correlator, or different 5 
versions of the software correlator can be run on the same 
processor. Hardware-correlator-based receivers of the prior 
art can deal only with frequencies and PRN codes that are 
hard-wired into their designs. Also, the system and method 
of the present invention could be implemented within sys- to 
terns such as GLONASS receivers, cell phones and cell base 
stations, pagers, wireless Ethernet (e.g. 802. llx standards), 
Bluetooth™, Blackberry® wireless internet devices, and 
satellite radio/phones (e.g. INMARSAT®). In fact, the sys- 
tem and method of the present invention are applicable to is 
any sort of telecommunication system/device that uses 
spread spectrum, code division multiple access (CDMA) 
PRN codes for the transmission of information, either wired 
or wireless. 

For a better understanding of the present invention, 20 
together with other and further objects thereof, reference is 
made to the accompanying drawings and detailed descrip- 
tion. The scope of the present invention is pointed out in the 
appended claims. 

25 

DESCRIPTION OF THE SEVERAL VIEWS OF 
THE DRAWING 

FIG. 1 is a schematic block diagram of the hardware 
environment of a typical software receiver; so 

FIGS. 2A and 2B are schematic diagrams of bit-wise 
mappings of signal and carrier replica sign and magnitude 
bits to computer data words; 

FIG. 2C is a graphic representation of a plot of bit-wise 
parallel radio frequency signal and PRN code replica storage 35 
and mixing; 

FIG. 2D is a graphic representation of a plot of sections 
of prompt, early, late, and early-minus-late PRN code sig- 
nals and 16-bit word representations of their over-sampled 
equivalents; 40 

FIGS. 3A and 3B are data flow diagrams illustrating the 
bit-wise parallelism process (replicated twice, once for the 
in-phase carrier replica and once for the quadrature carrier 
replica) of the present invention; 

FIG. 3C is a graphic representation of a plot of a prior art 45 
optimal 2 -bit representation of a sine wave presented to 
enhance the reader’s understanding of the present invention; 

FIGS. 4A and 4B are flowcharts of the method for 
computing correlation accumulations through bit-wise par- 
allel computations of the present invention; 50 

FIG. 5 is a schematic diagram of a look-up table layout as 
a function of code time offset and chip bit pattern; 

FIG. 6 is a graphic representation of a plot illustrating the 
timing relationship between data sample words and the 
sequence of prompt code chips that defines an accumulation 55 
interval; 

FIG. 7 is a flowchart of the method for computing bit-wise 
parallel representations of the over-sampled prompt PRN 
code replica and the over-sampled early-minus-late PRN 
code replica for an entire accumulation interval using the 60 
real-time over-sampled PRN code generation algorithm. 

FIG. 8 is a graphic representation of a plot that illustrates 
the location in time at which the code phase of each signal 
is computed; and 

FIGS. 9A and 9B are graphic representations of plots of 65 
correlations of the true sampled code with prompt (FIG. 9A) 
and early-minus-late (FIG. 9B) versions of the true and table 


look-up codes, the latter being generated by the new real- 
time over-sampled PRN code generator. 

DETAILED DESCRIPTION OF THE 
INVENTION 

The present invention is now described more fully here- 
inafter with reference to the accompanying drawings, in 
which the illustrative embodiment of the present invention is 
shown. The following configuration description is presented 
for illustrative purposes only. Any computer configuration 
satisfying the speed and interface requirements herein 
described may be suitable for implementing the system of 
the present invention. The equations herein are stated in 
general terms, but have parameters that are specific to the 
GPS LI C/A signal for illustrative purposes only. For 
example, the 0.001 sec. accumulation interval seen in many 
of the equations is the nominal C/A code period. Also, the 
C/A PRN code of the illustrative embodiment can be 
replaced by the PRN code of any other CDMA signaling 
system. 

By way of introductory explanation, RF signal processing 
equations and terms are herein provided. The time-domain 
LI C/A signal received from, for example, a satellite, is 
represented by: 


yui)~ V AjDjtck.mi '' Tjt ) 

I \ T jk+l ~ T jk ). 


COS[cOifti - (f)j(tj )] + flj 


( 1 ) 


where t, is the sample time, A • is the amplitude, D ]k is the 
navigation data bit, Cy[t] is the C/A code, x jk and x jk+1 are the 
start times of the received k' 7 ' and k+U C/A code periods, 
to f is the intermediate frequency corresponding to the LI 
carrier frequency, <j> y (t,) is the carrier phase perturbation due 
to accumulated delta range, n y is the receiver noise, and the 
subscript j refers to a particular GPS satellite. The summa- 
tion is over all visible GPS satellites. The negative sign in 
front of 4>( t,) comes from the high-side mixing that occurs in 
the RF front-end that has been used in the illustrative 
embodiment. The signal in equation (1) is the output of a 
typical RF front end. 

A GPS receiver works with correlations between the 
received signal and a replica of it. The correlations are used 
to acquire and track the signal. The replica is composed of 
two parts, the carrier replica and the C/A PRN code replica. 
Two carrier replica signals are used, an in-phase signal and 
a quadrature signal. When mixed with the received signal 
from the RF front end they form the in-phase and quadrature 
baseband mixed signals represented by: 


yijdi) = cjo.ooif ^ f ' T * )| 

L \ T jk + 1 ~ T jk) \ 

cos(«,rt - [$ jk +w DwjI (f, -fjj)]} 

sinjw/rf - [<j> jk + OiDoppjk Pi -?>)]} 


(2) 


(3) 


where equations (2) and (3) apply during the k r, ‘ C/A code 
period. In these equations x jk and x jk+1 are the receiver’s 
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estimates of the start times of the k r,! and k+D' code periods, 
<jp j k is the estimated carrier phase at time x k , and (o D - k is 
the estimated carrier Doppler shift during the k'* code 
period. 

A typical receiver computes the estimates x jk , x jk+1 , 
and to Doppjk by various conventional means that are 
described in GPS Receivers, A. J. Van Dierendonck, Global 
Positioning System: Theory and Applications, B. W. Parkin- 
son and J. J. Spilker, Jr., Eds., vol. I, American Institute of 
Aeronautics and Astronautics, 1996, Chapter 8, pp. 329-406 
(Dierendonck), incorporated herein in its entirety by refer- 
ence. These include open-loop acquisition methods and 
closed-loop signal tracking methods such as a delay-locked 
loop to compute x k and x k+1 and a phase-locked loop or a 
frequency-locked loop to compute § jk and v> Doppjt . The 
software receiver developed herein uses conventional tech- 
niques for forming these estimates. 

Both prompt and early-minus-late correlations are needed 
to track the carrier frequency, carrier phase, and code phase 
in a GPS receiver. A typical receiver uses the PRN code and 
carrier replicas to compute the following in-phase and 
quadrature correlation accumulations: 


teij L v T jk+l - Xjk 

COs(o ), F ti - [</>j t + tb D pppjk ft -fj k )]} 

F ( r,- + A - r.i )1 

Qjk( A) = - V y(ti)Cj 0.001 J A 

fcf k L \Tjt+l -Tj k 

sinjw; F r; - [lp jk + io D ppp jk (/,• - f Jk )]} 


where i^. is the index of the first RF front-end sample time 
that obeys x jk =X ik and N^+l is the total number of samples 
that obey Xj k =l i <x ]k+1 . The time offset A causes the replica 
PRN code to play back early if it is positive and late if A is 
negative. The prompt correlations are defined by equations 
(4) and (5) with A=0. The early-minus-late correlations are 
U( A ^' /2 )"W _A ^' /2 ) and ^jtrl A e m i/2)-Q jk (-A em/ /2), where 
A eml is the spacing between the early and late PRN carrier 
replicas. The present invention described herein is an effi- 
cient technique for the receiver to accumulate \ jk and Q jk in 
software. 

Referring now to FIG. 1, the operational platform of the 
software receiver 10 of the present invention includes an 
antenna 11, conventional RF front-end 13, a data acquisition 
(DAQ) system 17, a microprocessor 16, a software correla- 
tor 19, and application-specific code 15. Conventional RF 
front-end 13 interfaces with antenna 11 and with (DAQ) 
system 17. DAQ system 17 includes a system of shift 
registers and a data buffer. Microprocessor 16 executes 
software correlator 19, which includes a set of specially 
developed bit-wise parallel algorithms, and application- 
specific code 15, such as the GPS navigation and tracking 
functions. In the illustrative embodiment, conventional GPS 
software functions (signal tracking, data extraction, naviga- 
tion solution, etc.) are provided by the MITEL® GPS 
Architect software ported to RTLINUX® (see A Coming of 
Age for GPS: A RTLINUX BASED GPS RECEIVER, 
Ledvina et al.. Proceedings of the Workshop on Rea l Time 
Operating Systems and Applications and Second Real Time 
Linux Workshop (in conjunction with IEEE RTSS 2000) Nov. 
27-28, 2000), but can be provided by any equivalent con- 
figuration. 
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Continuing to refer to FIG. 1, conventional RF front-end 
13 can, for example, be a MITEL® GP2015 RF front-end, 
which down converts the nominal 1.57542 GHz GPS signal 
12 to an intermediate frequency of (88.54/63)xl0 6 
5 Hz=l .4053968254 MHz and then performs analog-to-digital 
conversion. The resultant, digitized signal data 21 has a 
pre -determined number of bits/sample, such as two binary 
bits/sample, a sign bit and a magnitude bit, or one bit/ 
sample. The shift registers in the DAQ system 17 parallelize 
10 the magnitude and sign data bit streams into separate words, 
which the DAQ system 17 reads into the memory of 
microprocessor 16 using DMA. To make the process of 
reading data into the microprocessor 16 more efficient and to 
prepare for efficient correlation calculations, DAQ system 
15 17 can read a pre-specified number of bits of buffered 
samples, such as thirty-two bits, at a time. The exemplary 
thirty-two bits include sixteen sign bits and sixteen magni- 
tude bits. 

Referring now to FIGS. 1 and 2A, the shift registers in 
20 DAQ system 17 (FIG. 1) buffer signal data 21 (FIGS. 1 and 
2A) and pack signal sign 21A (FIG. 2A) and signal magni- 
tude 21B (FIG. 2A) into separate words, that represent the 
integer values ±1 and ±3 as is shown in Table 1. In the case 
of a 1-bit signal, the bit stream representing the samples is 
25 packed into successive words to prepare the signal for 
bit-wise parallel processing. DAQ system 17 also provides 
for accurate timing by, for example, synchronizing signal 
sign 21A and signal magnitude 21B to a (40/7)xl0 6 
Hz»5.714 MHz clock signal, which can be, for example, a 
30 third output from conventional RF front-end 13 (FIG. 1). 
DAQ system 17 can convert the 5.71424 MHz clock signal 
down to 357.14 KHz by use of, for example, a divide-by-16 
counter for a 16-bit word, which can provide a signal 
indicating when the buffer is full. DAQ system 17 can use 
35 any method for providing a buffer full indication. 


TABLE 1 


The sign and magnitude combinations of the sample RF output of the 
conventional RF front-end and their corresponding values. 

Signal Sign 21A 

Signal Magnitude 21B 

RE Signal Value 

0 

0 

-1 

0 

1 

-3 

1 

0 

+1 

1 

1 

+3 


With further reference to FIG. 1, in the illustrative 
embodiment, the DAQ system 17 can consist of an interface 
50 card and driver software that can be compatible with, for 
example, a 1.73 GHz AMD ATHLON processor running 
RTLINUX®, but could be compatible with any operating 
system and any processor that can accommodate real-time 
operations. The interface card can, for example, be a 
55 NATIONAL INSTRUMENTS® PCI-DIO-32HS digital I/O 
card. Pertinent features of this card are the thirty-two digital 
input lines, DMA, and availability of a driver for 
RTLINUX®, perhaps gotten from the suite of open source 
drivers and application interface software for interface cards 
60 known as COMEDI (COntrol and MEasurement and Device 
Interface). Modifications to the conventional COMEDI 
driver for the PCI-DIO-32HS card include increasing the 
number of input bits from sixteen to thirty-two, enabling 
DMA, and modifying the driver to support continuous 
55 interrupt -driven acquisition. 

With still further reference to FIG. 1, microprocessor 16 
can be, for example, a 1.73 GHz AMD ATHLON™ proces- 
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sor running the RTLINUX® operating system, but any 
operating system and processor that can accommodate real- 
time operations can be used. Low latency interrupt respon- 
siveness, the ability to execute threads at regular intervals, 
with the kernel having a possibility of being the lowest 
priority thread, and reliable execution of time-critical code 
are among features of an operating system that could 
enhance the performance of the system of the present 
invention. The use of RTLINUX® is presented herein for 
illustrative purposes only. 

Continuing to refer to FIG. 1, analogous to a hardware 
correlator that takes input directly from the RF front end in 
serial fashion, software correlator 19 reads from a shared 
memory buffer that both software correlator 19 and DAQ 
system 17 can access, the former to read data, and the latter 
to write data. The shared memory buffer can be implemented 
as a DMA memory space and a circular buffer. In the 
illustrative embodiment in which the system and method of 
the present invention are used in a GPS (or similar) envi- 
ronment, microprocessor 16 can store the most recent 
twenty-one milliseconds of signal data 21 (FIGS. 1 and 2 A) 
in the circular buffer, but could store more or less. The 
present invention does not fix the size of the circular buffer, 
nor the amount of RF data that can be stored there. The 
circular buffer allows the processing of code periods that 
start and stop at different times for different satellites during 
different iterations of a regularly scheduled program thread. 
DMA memory space can be written to directly by DAQ 
system 17 using a DAQ software driver, which fills the 
circular buffer. Communication between software correlator 
19 and application-specific code 15 can be performed using 
operating system-provided shared memory capability. For 
example, the mbuff driver, included with RTLINUX®, can 
be used to create and manage this shared memory space. 
Any memory management system that accommodates real- 
time processing can be used. If the mbuff driver is used, 
kernel modules can share memory and the kernel can be 
restricted from swapping the shared memory space to long- 
term storage. 

Continuing with the analogy to hardware correlation, and 
still referring primarily to FIG. 1, in hardware correlation, 
the correlator receives frequency and phase information 
from tracking and acquisition loops that are part of appli- 
cation code, and Numerically Controlled Oscillators (NCOs) 
generate signals that correspond to the written frequencies 
and phases. In contrast, software correlator 19 includes 
simulated carrier and code NCOs that receive their fre- 
quency commands from application-specific code 15. Soft- 
ware correlator 19 uses these frequency commands to recon- 
struct carrier replica signal 25 (FIG. 3A) and prompt PRN 
code 29 and early-minus-late PRN code 35 (FIG. 3A) which 
it mixes with the signal data 21 (FIG. 2A) resulting in fully 
mixed prompt integrand 31 and fully mixed early-minus-late 
integrand 33 (FIG. 3A). 

To further continue the analogy, a hardware correlator 
generates in real-time a particular C/A code replica at the 
correct Doppler shifted frequency and phase. In contrast, 
software correlator 19 can generate C/A codes off-line and 
store them in a memory table, the pre-computed over- 
sampled PRN code table 28 (FIG. 3A). The pre-computed 
over-sampled PRN code table 28 is used to select PRN codes 
with the correct timing relationship to the sample times of 
signal data 21 (FIG. 3A). The codes are then used to form 
correlations with baseband mixed signals 23 (FIG. 3A), the 
result from which is summed to produce the standard 
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in-phase and quadrature, summed prompt accumulation 45 
(FIG. 3B) and summed early-minus-late accumulation 47 
(FIG. 3B) that are equivalent to what would be produced by 
a continuously variable sine wave. These are provided to 
5 application-specific code 15, such as conventional GPS 
software that executes signal tracking and navigation func- 
tions. In a second approach, software correlator 19 can 
generate the PRN carrier replicas on-line at the code chip- 
ping rate and can use tabulated functions to re-sample the 
10 code at the sample rate of the RF front-end for purposes of 
calculating accumulations. Real-time over-sampled PRN 
code generator 30A (FIG. 3A) is used in place of pre- 
computed over-sampled PRN code table 28 (FIG. 3A) in this 
15 latter approach. This latter method can be used with longer 
PRN codes, such as the new civilian GPS L2 CL codes. 

With still further reference to FIG. 1, since the received 
LI raw signal 12 can have an uncertain carrier phase, 
software correlator 19 computes both in-phase (I) and 
20 quadrature (Q) accumulations, as defined in equations (4) 
and (5). Software correlator 19 begins the accumulation 
process by using carrier replica signal 25 (FIG. 3A), which 
it gets from pre-stored carrier replica table 30 (FIG. 3A). The 
carrier replicas in this table fall on a rough frequency grid, 
25 and they all start with a particular phase, for example a phase 
of zero. The baseband mixing process involves selecting a 
carrier replica signal 25 (FIG. 3A) from carrier replica table 
30 (FIG. 3A) that is at the frequency that is as close to 
“ideal” as possible. In the case of a 175 Hz grid spacing, the 
baseband mixing process selects a signal that is maximally 
within ±87.5 Hz of the ideal signal. The rough frequency 
grid can have a spacing of, for example, 175 Hz but could 
be larger or smaller depending on (a) the frequency range 
35 needed to cover, for example, ±10 Khz, (b) the amount of 
space available for storing pre-computed signals, and (c) 
other design decisions. The pre-computed signals in carrier 
replica table 30 (FIG. 3A) each may occupy 180 32-bit 
words in order to be guaranteed to cover the full 5,714 RF 
40 front-end samples that occur in one PRN code period for any 
possible code period start time within the thirty-two samples 
of the initial word. Thus, 180*4=720 bytes could be required 
for each bit of each pre-computed carrier replica signal 25 
that is stored in the table. The sine and cosine waves of 
45 carrier replica signals 25 (FIG. 3A) each have 2-bit repre- 
sentations, which translates into a storage requirement of 
2880 bytes for the carrier replica signals 25 at a given 
Doppler shift. There are 115 Doppler shifts that may be 
stored in order to cover the -10 KHz to +10 KHz range with 
: ’ 0 a 175 Hz grid spacing. This translates into 323 Kbytes of 
storage for all of the carrier replica signals 25. This approach 
avoids the need to pre-compute sine waves with a prohibi- 
tively large number of possible frequencies and phase offsets 
and it avoids the need to compute sine waves in real-time. 
” Instead, the errors created by using pre-defined sine wave 
replicas are compensated for by post-processing calcula- 
tions, as described below. 

In any case, and continuing to refer to FIG. 1, the resulting 
accumulations are 

60 

‘k +N k r ii (Q 

l,M A)= X 0.00l|2 

7“" L \Tjk+l-Tjk !\ 

' ‘k 

65 

COS[(£D//r — (*)gjk){ti ~ 
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-continued 


•k +N k 

Q gjt (A) = - ^ y(r,')clo 


W ?i+A -^)1 

(7) 

( sin(7rA/T)'\ 
A5W = 201og 10 ( ) 

(12) 

L V TjA+i -TjkJ] 
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sin[(£o, F -WjjiXr, -(ojji)] 


where w k is the grid frequency that is closest to the 
estimated frequency <s> D jk and where t 0gl - k is the time at 10 
which this carrier replica signal 25 (FIG. 3A) has zero 
carrier phase. Software correlator 19 rotates these accumu- 
lations in order to create accurate approximations of what 
would have been computed had the estimated carrier phase 
time history in equations (4) and (5) been used: 15 

/ /t( A H^*( A )“»(4+.v S ^+O g (it(A)8m(A+ BVS[ , 1 ) (8) 

Syt(A)=-/ s , t (Ajsin(A(|) ol , s ,. lt )+e s/ll (A)cos(A<t> ovs/lt ) (9) 

where A<|) avg;7t is the average phase difference between the 20 
grid carrier phase and the estimated carrier phase averaged 
over the accumulation interval: 


. , (Tjk + Tjk+l ) . (10) 25 

A 9avgjk ~ ° J gjk [ 2 ) ' - !k 

( Tjk + 1 “ Tjk ) 
a Doppjk\ ~ 


Note that equations (8), (9), and (10) are an illustrative 
example of how software correlator (19) can rotate its I and 
Q accumulations in order to correct for phase and frequency 
errors in its table of pre-computed carrier replica signals. 
There exist other formulas that yield equivalent results, and 35 
this patent disclosure covers all such techniques. 

The validity of equations (8) and (9) is dependent on the 
assumption that 


1 COS 2 ^gjk ko Doppjk i (Tjk+l g jk i 1 


For example, a 175 Hz grid spacing and a nominal C/APRN 45 
code period of 0.001 sec yields a value on the left-hand side 
of inequality (11) of 0.04, which respects the assumed limit. 

Note that equations (8) and (9) can be derived from 
equations (4) and (5) as follows. First, the carrier phase of 
the grid signal in the arguments of the cosine and sine terms 50 
of equations (6) and (7) are added to and subtracted from the 
arguments of the cosine and sine terms in equations (4) and 
(5). Next, trigonometric identities are used to split the 
resulting cosine and sine terms into sums of products of 
cosine and sine functions. In each product, one of the terms 55 
involves an argument like the arguments in the trigonometric 
terms in equations (6) and (7). The other trigonometric terms 
are then approximated by either cos(Ac|> avg(7t ) or sin(A(() avg(7t ). 
These approximations are valid because of the inequality in 
equation (11) and because the average of sin {(oi^- 60 
^Doppjkfcr 1/ ' 2 ('tjk +x jk +i)]} over the accumulation interval is 
zero. 

A decrease in the carrier to noise ratio C/N 0 , which 
characterizes the receiver’s sensitivity, is caused by the use 
of an inexact baseband mixing frequency. The worst-case 65 
decrease is expressed as a function of the frequency grid 
spacing Af and is given by 


where Af is in units of Hz, and T is the integration period. 
Thus, a Af of 175 Hz causes a worst -case C/N 0 loss of 0.11 
dB for T=0.001 sec. 

Referring now to FIGS. 2A, 2B, and 3A, PRN codes 
(composed of prompt PRN codes 29 (FIG. 3A) and early- 
minus-late PRN codes 35 (FIG. 3A)) are either pre-com- 
puted or generated in real-time. Pre-computing involves, for 
each satellite, computing an entire PRN code, storing the 
PRN code appropriately for easy retrieval, and referencing 
the PRN code, possibly by means of indices that are com- 
puted based on, for example, the incoming RF signal data 21 
(FIGS. 2A and 3A). Pre-computing can be most advanta- 
geously used when the PRN code is not very long. Gener- 
ating PRN codes in real-time can be a more appropriate 
solution when the PRN codes are very long (and thus would 
require an unacceptable amount of storage), or perhaps when 
too many PRN codes are required for the amount of storage 
available, or for any other reason, but real-time PRN code 
generation can entail an additional computational cost. Both 
pre-computing and real-time determination of PRN codes 
are described herein with respect to a bit-wise parallel 
implementation. 

Continuing to refer primarily to FIGS. 2A, 2B, and 3A, in 
order to perform bit-wise parallel operations, software cor- 
relator 19 (FIG. 1) stores pre-computed carrier replica sign 
25A (FIGS. 2B and 3A) and carrier replica magnitude 25B 
(FIGS. 2B and 3A) in data words. Simple representations of 
signal data 21 (FIGS. 2A and 3A) and carrier replica signal 
25 (FIGS. 2B and 3A) in terms of one, two, or more bits are 
suitable for using bit-wise parallelism to perform the calcu- 
lations described herein. Bit-wise parallel operations work 
with representations of the data that store successive 
samples in successive bits of a word. For example, thirty- 
two samples (bits) of the RF front-end output are stored in 
two N=32-bit words, signal sign 21A(FIGS. 2Aand 3A) and 
signal magnitude 21B (FIGS. 2A and 3A), or simply a single 
32-bit word if signal data 21 consists of a single data bit. 
Carrier replica sign 25A (FIGS. 2B and 3A) and carrier 
replica magnitude 25B (FIGS. 2B and 3A) are stored, for 
example in tables, in separate words, with each 32-bit word 
storing thirty-two sign or magnitude bits that tabulate to 
thirty-two successive samples of the corresponding cosine or 
sine wave. Similarly, tables can store prompt PRN code 29 
(FIG. 3A) and early-minus-late PRN code 35 (FIG. 3A), 
which are composed of prompt PRN code sign 29A (FIG. 
3A), early-minus-late PRN code sign 35A (FIG. 3A), and 
early-minus-late PRN code zero mask 35B (FIG. 3A). The 
data words that comprise the bit-wise parallel representa- 
tions of these three signal types, the original RF signal data 
21 (FIGS. 2A and 3A), the carrier replica signal 25 (FIGS. 
2B and FIG. 3A), and the de-spreading prompt PRN code 29 
(FIG. 3A) and early-minus-late PRN code 35 (FIG. 3A), are 
the inputs to the calculations of software correlator 19 (FIG. 
1 ). 

Further continuing primarily to refer to FIGS. 2A, 2B, and 
3A, many intermediate calculated quantities and at least 
three types of intermediate signals are also stored in bit-wise 
parallel format. First there are the in-phase and quadrature 
baseband mixed signals 23 (FIG. 3A), whose 3-bit repre- 
sentations for the illustrative embodiment are stored as 
baseband mixed sign 23A (FIG. 3A), baseband mixed high 
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magnitude 23B (FIG. 3A), and baseband mixed low mag- 
nitude 23C (FIG. 3A). The second bit-wise parallel signal 
type is the fully mixed integrand, of which there are four 
signals: in-phase and quadrature fully mixed prompt inte- 
grand 31 (FIG. 3A) and in-phase and quadrature fully mixed 
early-minus-late integrand 33 (FIG. 3A). The former are 
stored as 3-bit representations in the illustrative embodiment 
as fully mixed prompt integrand sign 31A (FIG. 3A), fully 
mixed prompt integrand high magnitude 31B (FIG. 3A), and 
fully mixed prompt integrand low magnitude 31C (FIG. 
3A). The latter are stored as 3.5-bit representations in the 
illustrative embodiment as fully mixed early-minus-late 
integrand sign 33A (FIG. 3A), fully mixed early-minus-late 
integrand high magnitude 33B (FIG. 3A), fully mixed 
early-minus-late integrand low magnitude 33C (FIG. 3A), 
and fully mixed early-minus-late integrand zero mask 33D 
(FIG. 3A). This representation is called a 3.5-bit represen- 
tation because the sign, high-magnitude, and low-magnitude 
bits are ignored if the corresponding zero mask bit has the 
value zero. The third bit-wise parallel signal type is a value 
word, of which there are two types: prompt integrand value 
words 27 (FIG. 3B) and early-minus-late integrand value 
words 37 (FIG. 3B). Each fully mixed integrand is used to 
construct value words, one word for each possible value that 
the integer integrand can take on. There are eight possible 
values for the integrands of the illustrative embodiment: -1, 
-2, -3, -6, 1, 2, 3, and 6 for the in-phase and quadrature 
fully mixed prompt integrands 31 (FIG. 3A) and -2, -4, -6, 
-12, 2, 4, 6, and 12 for the in-phase and quadrature fully 
mixed early-minus-late integrands 33 (FIG. 3A). Each bit- 
wise parallel value word contains a one bit for each sample 
time when the integrand value equals the value of the value 
word, but it contains a zero bit for all other sample times. 
The storage of raw data and intermediate results in bit-wise 
parallel format allows the EXCLUSIVE OR operations that 
are involved in mixing to operate on thirty-two samples at a 
time if microprocessor 16 (FIG. 1) has a bit-wise EXCLU- 
SIVE OR command. Other bit-wise commands are used to 
perform additional software correlation operations in paral- 
lel on sets of two thirty-two samples. 

At this point, the problem of over-sampling is introduced. 
Referring now to FIGS. 2C and 2D, the problem of over- 
sampling is illustrated with respect to bit-wise parallelism as 
follows. There is normally more than one RF data sample 
per PRN code chip. The three successive -1 values 73 (FIG. 
2C) at sample times U to t 3 all occur during the same PRN 
code chip as do the four successive +1 values 75 (FIG. 2C) 
at times t 4 through t 7 . The difference in the number of 
samples for the two code chips arises because the PRN code 
chip period is not an integer multiple of the sample period. 
Analogously, referring to FIG. 2D, where sample interval 
At 63 is less than actual PRN code chip length At c 65, 
over-sampling is indicated because the RF sampling fre- 
quency f^xl/AL is greater than the PRN code chipping 
frequency f e =l/At c . PRN codes for CDMA signaling are 
sequences of +1 and -1 values, the elements of which are 
chips. Over the time intervals of interest, a carrier replica 
progresses through its chips at a constant chipping rate of 
f c =l/At c chips/second. The time interval At c is the actual 
PRN code chip length 65 (FIG. 2D). Software correlator 19 
(FIG. 1) normally receives PRN code, and attempts to align 
it with the prompt replica version of the code, prompt PRN 
code 29 (FIG. 2D). It makes use of the signal’s correlation 
with prompt PRN code 29 (FIG. 2D) and with early-minus- 
late PRN code 35 (FIG. 2D) in order to determine a chipping 
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rate f c that tends to align prompt PRN code 29 (FIG. 2D) as 
desired. Conventional methods for determining f c are well- 
known in the art. Chips of early code 69B (FIG. 2D) start 
and stop 0.5At em/ seconds before the corresponding chips of 
5 prompt PRN code 29 (FIG. 2D), and the chips of late code 
69C (FIG. 2D) start and stop 0.5At em/ seconds after prompt 
PRN code 29 (FIG. 2D). Early-minus-late PRN code 35 
(FIG. 2D) is the difference between early code 69B (FIG. 
2D) and late code 69C (FIG. 2D). Example segments of 
10 these four types of replica codes are depicted in FIG. 2D. 

Referring to FIGS. 1, 2A, 2C, and 2D software correlator 
19 (FIG. 1) receives, through conventional RF front end 13 
and DAQ system 17, signal data 21 (FIG. 1), the raw data 
15 12 (FIG. 1) source of which is sampled at the rate f s =l/At s 
Hz. In order to process the resulting RF signal data 21, 
software correlator 19 (FIG. 1) needs prompt PRN code 29 
(FIG. 2D) and early-minus-late PRN code 35 (FIG. 2D) 
replicas sampled at the same times as raw signal 12 (FIG. 1). 
20 FIG. 2D depicts sixteen sample times as vertical dash-dotted 
lines. Referring to FIG. 2D, prompt PRN code 29 (FIG. 2D) 
can be represented by its prompt PRN code sign 29A (FIG. 
2D) at the sample times. The bit value one represents +1, and 
the bit value zero represents -1. Prompt PRN code sign 29A 
25 (FIG. 2D), shown at the sixteen sample times — starting with 
three Is, continuing with ten 0s, and finishing with another 
three Is — is a 16-bit word stored as the integer 
2 ls +2 14 +2 :13 +2 2 +2 :1 +2 0 =57351. Early-minus-late PRN code 
35 (FIG. 2D) requires a 1.5-bit representation. A zero mask 
bit is set to zero if early-minus-late PRN code 35 takes on 
the value zero, and it is set to one if early-minus-late PRN 
code 35 equals +2 or -2. Early-minus-late PRN code zero 
mask 35B (FIG. 2D) at sixteen sample times shown in FIG. 
35 2D is equivalent to 2 1 '’+2 12 +2 2 =12292. A 2’s sign bit is set 
to one if early-minus-late PRN code 35 (FIG. 2D) equals +2 
at the sample time, and it is set to zero if the code equals -2. 
The 2’s sign bit is irrelevant if the corresponding early- 
minus-late PRN code zero mask 35B (FIG. 2D) bit equals 
40 zero. Early-minus-late PRN code sign 35A (FIG. 2D) for 
sixteen sample times contains X values that indicate bits 
whose values are irrelevant because the corresponding early- 
minus-late PRN code zero mask35B (FIG. 2D) bits are zero. 
In an illustrative embodiment, all the X values become zero, 
45 thus the equivalent integer for early-minus-late PRN code 
sign 35A (FIG. 2D) is 2 2 =4. 

Continuing to refer to FIG. 3 A, an alternative to taking the 
prompt PRN code 29 and early-minus-late PRN code 35 
from pre-computed over-sampled PRN code table 28 is to 
50 generate prompt PRN code sign 29A, early-minus-late PRN 
code sign 35A, and early-minus-late PRN code zero mask 
35B using real-time over-sampled PRN code generator 30A 
(FIG. 3A). Shown in FIG. 3A are two circles and a loose 
arrow with a quarter circle pointer. These are the symbols for 
55 a switch and indicate the ability of the system to choose 
possible alternate sources of PRN code. Using the real-time 
over-sampled PRN code generator 30A includes a step of 
generating the PRN code chips in real-time by conventional 
means. For example, the GPS civilian L2 CL and CM codes 
60 are generated by a 27-bit feedback shift register (see The 
NewLl Civil Signal, R. D. Fontana et al.. Proceedings of the 
ION GPS 2001, Sep. 11-14, 2001, Salt Lake City, Utah, pp. 
617-631). The method further includes the steps of choosing 
chip values from the PRN code, where the chip values 
65 correspond to a data interval that contains the samples of a 
data word and where the chips have a known timing relative 
to the data interval, transforming the relative timing into a 
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time grid index, and translating the PRN code chip values 
and the time grid index for the data interval into the PRN 
code’s over-sampled bit-wise parallel format. These latter 
steps can be carried out efficiently by using a table look up 
function. One table each for prompt PRN code sign 29A, 
early-minus-late PRN code sign 35A, and early-minus-late 
PRN code zero mask 35B can include integer values that 
constitute the bit-wise parallel representation of the PRN 
code for the sample times associated with the data word in 
question. Indices into each 1-dimensional table are functions 
of (a) the time offset between the first PRN code chip and the 
first sample time of the given data word, and (b) the bit 
pattern of the PRN code chips that span the sample times of 
the data word. The sizes of the tables are independent of the 
period of the PRN code that is being over-sampled. The 
tables can be re-used for multiple PRN codes in a multi- 
channel receiver. The computation and use of the tables are 
discussed in more detail later. 

Continuing with the description of bit-wise parallelism 
with respect to the operations of software correlator 19 (FIG. 
1), and continuing to refer to FIG. 3A, the specially- 
developed algorithms described herein make use of bit-wise 
parallelism so that a single programming language state- 
ment, such as a C code command, can partially process up 
to thirty-two samples at a time. Previously referred-to carrier 
replica signal 25 in the form of cosine and sine signals are 
stored as binary carrier replica sign 25A and carrier replica 
magnitude 25B. The format of this representation is defined 
in Table 2 and illustrated in FIG. 3C which is a reconstructed 
carrier and carrier replica in the form of representative sine 
signal 51 (FIG. 3C) shown in optimal 2-bit representation 53 
(FIG. 3C) that has the minimum square error. The format of 
Table 2 assumes that the cosine and sine signals have an 
amplitude of approximately 2.4. Note that other represen- 
tations, beyond 2-bit representation, are possible. In general, 
more bits yield a better SNR, but can also require a larger 
number of computations for the correlation operations. 


TABLE 2 


Sign and magnitude combinations of the stored intermediate- 
frequency baseband mixing carrier sine wave replicas 
and the values that they represent 

Carrier Replica Sign 
25A 

Carrier Replica 
Magnitude 25 B 

Carrier Replica Value 

0 

0 

-1 

0 

1 

-2 

1 

0 

+1 

1 

1 

+2 


Continuing to refer to FIG. 3A, multiplication of the RF 
front-end output representation, the signal sign 21A and 
signal magnitude 21B, of Table 1 by the sine wave repre- 
sentation, carrier replica sign 25A and carrier replica mag- 
nitude 25B, of Table 2 yields baseband mixed signals 23, 
consisting of baseband mixed sign 23A, baseband mixed 
high magnitude 23B, and baseband mixed low magnitude 
23C, that can take on the values - 6 , - 3 , - 2 , - 1 , + 1 , + 2 , + 3 , 
and + 6 , as shown in Table 3 . Baseband mixed high magni- 
tude 23B is simply signal magnitude 21B, and baseband 
mixed low magnitude 23C is carrier replica magnitude 25B. 
Thus, these two magnitude bits are available without the 
need for computation. Baseband mixed sign 23Ais the result 
of an EXCLUSIVE OR operation between signal sign 21A 
and carrier replica sign 25A. Notice how the relationship of 
the sign bit value with the actual sign gets reversed from that 
of Tables 1 and 2 . 


TABLE 3 


Sign, high- magnitude, and low-magnitude combinations 
of the baseband- mixed signal and their 
corresponding signal values. 


Baseband 
Mixed Sign 
23A 

Baseband Mixed 
High Magnitude 
23 B 

Baseband Mixed 
Low Magnitude 
23C 

Baseband Mixed 
Value 

0 

0 

0 

+1 

0 

0 

1 

+2 

0 

1 

0 

+3 

0 

1 

1 

+6 

1 

0 

0 

-1 

1 

0 

1 

-2 

1 

1 

0 

-3 

1 

1 

1 

-6 


Continuing to refer to FIG. 3A, and continuing to describe 
the bit-wise parallel algorithms, the required amount of 
20 storage for tables of pre-computed prompt PRN code 29 and 
early-minus-late PRN code 35 can be greatly reduced by 
making two simplifications. First, the prompt PRN code 29 
is stored as prompt PRN code sign 29A. This representation 
is shown in Table 4. The early-minus-late PRN code 35, on 
25 the other hand, is stored in a two-bit representation (actually 
a 1.5 bit representation): early -minus-late PRN code sign 
35A and early-minus-late PRN code zero mask 35B, as 
denoted in Table 5. Note that the X in the first column of 
Table 5 indicates that zero or one can be placed in this 
.to location without affecting the corresponding code value. The 
X signifies a lack of effect of the sign bit on the code value 
when the zero mask bit equals zero. This is why the 
early-minus-late PRN code 35 representation is referred to 
as a 1.5-bit representation. This X value will affect the 
35 corresponding fully mixed early-minus-late integrand sign 
33A, but it will not affect any of the early-minus-late value 
words because the zero value in the corresponding zero 
mask location will null out the corresponding bit of all 
early-minus-late value words. 

40 

TABLE 4 


Sign bits of the prompt C/A code and the corresponding 
prompt signal values. 

45 Prompt PRN Code 

Sign 29A Prompt Code Value 


1 +1 

0 -1 


TABLE 5 


Sign and zero mask bit combinations of the early-minus-late 

PRN code 35 and the corresponding signal values. 

55 

Early-minus- late PRN Early-minus-late PRN Early-Minus-Late Code 
Code Sign 35A Code Zero Mask 35B Value 


X 0 0 

0 1-2 


Another simplification in the pre-computed over-sampled 
PRN code table 28, and continuing to refer to FIG. 3A, can 
be to ignore code Doppler shift variations. All signals in the 
65 table are assumed to have zero Doppler shift; i.e., all C/A 
codes in the table assume that x j/c+l -x j/c =().00\ sec. Note that 
the period of 0.001 is applicable for accumulations that use 
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the full 1023 chips of the C/A code only. Any other type of 
code or accumulation interval may have a different period. 
The code phase errors due to this assumption can be elimi- 
nated by choosing a replica code from the pre-computed 
over-sampled PRN code table 28 whose midpoint occurs at 5 
the desired midpoint time (x J - k +x ]k+1 )/2. The only other effect 
of this assumption can be a small correlation power loss, 
which is no more than 0.014 dB if the magnitude of the 
Doppler shift is less than 10 KHz. The pre-computed over- 
sampled PRN code table 28 should include a selection of 10 
different phases, for example fourteen, as measured relative 
to a signal sample spacing of, for example, 175 nsec. This 
translates into a code phase spacing of, for example, 12.5 
nsec, which equals a pseudo range measurement digitization 
level of 3.8 m, or a maximum measurement error of 1.9 m. 25 
The number of phases in the pre-computed over-sampled 
PRN code table 28 is dependent upon the design of the 
system and no set number of phases is required by the 
present invention. Referring to FIG. 6, suppose that pre- 
computed over-sampled PRN code table 28 stores over- 20 
sampled bit-wise parallel representations of chips C(l) 
through C(M). The table must allow for the retrieval of 
over-sampled bit-wise parallel code replicas for a range of 
start times of code chip C(l) that span the entire first data 
sample word in the accumulation interval W 2 95 (FIG. 6). 25 
The table may contain code replicas whose different phases 
yield start times that span only a single sample interval of 
data word Wj 95 (FIG. 6), which is only 1/a, of the required 
number of start times. In this case the software correlator 
may apply bit shift operations to a tabulated PRN code - 50 
replica from that sample interval in order to generate the 
over-sampled bit-wise parallel PRN code replica that applies 
when chip C(l) starts in a different sample interval of data 
word Wj 95 (FIG. 6). 

Continuing to refer to FIG. 3A, and further continuing to 35 
describe the bit-wise parallel algorithms, prompt PRN code 
29 and early-minus-late PRN code 35 replicas can be mixed 
with the baseband mixed signals 23 to form fully mixed 
prompt integrand 31 by an EXCLUSIVE OR operation and 
bit re-definitions. An EXCLUSIVE OR between prompt 40 
PRN code sign 29 A and baseband mixed sign 23 A produces 
fully mixed prompt integrand sign 31A given in Table 6. The 
fully mixed prompt integrand high magnitude 3 IB and fully 
mixed prompt integrand low magnitude 31C are baseband 
mixed high magnitude 23B and baseband mixed low mag- 45 
nitude 23C, also given in Table 6. Note that the Table 6 
representation is identical to that of Table 3 except for the 
inversion in the meaning of the sign bits. The number of 
magnitude bits is dependent upon the design of the system 
and no set number of magnitude bits is required by the 50 
present invention. A change in the number of magnitude bits 
will cause a change in the number of entries of the equiva- 
lent of Table 6 and it will affect the possible values of the 
integrand. 


TABLE 6 


Sign, high- magnitude, and low-magnitude bit combinations of the fully 
mixed prompt integrand 31 and its corresponding values. 

Fully Mixed 
Prompt 

Integrand Sign 
31A 

Fully Mixed Prompt 
Integrand High 
Magnitude 31 B 

Fully Mixed 
Prompt 
Integrand L w 
Magnitude 31 C 

Fully Mixed 
Prompt 
Integrand 
Value 

0 

0 

0 

-1 

0 

0 

1 

-2 

0 

1 

0 

-3 


TABLE 6-continued 


Sign, high-magnitude, and low-magnitude bit combinations of the fully 
mixed prompt integrand 31 and its corresponding values. 

Fully Mixed 
Prompt 

Integrand Sign 
31A 

Fully Mixed Prompt 
Integrand High 
Magnitude 3 IB 

Fully Mixed 
Prompt 
Integrand L w 
Magnitude 31C 

Fully Mixed 
Prompt 
Integrand 
Value 

0 

1 

1 

-6 

1 

0 

0 

+1 

1 

0 

1 

+2 

1 

1 

0 

+3 

1 

1 

1 

+6 


Still continuing to refer to FIG. 3 A, the mixing of the 
early-minus-late PRN code 35 with the baseband mixed 
signals 23 forms fully mixed early-minus-late integrands 33. 
Fully mixed early-minus-late integrand sign 33A is an 
EXCLUSIVE OR between early-minus-late PRN code sign 
35A and baseband mixed sign 23A. Fully mixed early- 
minus-late integrand high magnitude 33B and fully mixed 
early-minus-late integrand low magnitude 33C are, as 
above, baseband mixed high magnitude 23B and baseband 
mixed low magnitude 23C. Fully mixed early-minus-late 
integrand zero mask 33D is early-minus-late PRN code zero 
mask 35B. The resulting representation is given in Table 7. 
As in Table 5, each X entry in the table indicates that the 
corresponding bit can be either zero or one without affecting 
the corresponding integrand value. 


TABLE 7 


Sign, high-magnitude, low- magnitude, and zero mask bit combinations 
of the fully mixed early- minus- late integrands 33 
and their corresponding values. 

Early-Minus- 
Late (EML) 
Integrand 
Sign 33 A 

EML 

Integrand 

High 

Magnitude 

33B 

EML 

Integrand 

Low 

Magnitude 

33C 

EML 
Integrand 
Zero Mask 
33D 

Early- 

Minus-Late 

Integrand 

Value 

X 

X 

X 

0 

0 

0 

0 

0 

1 

-2 

0 

0 

1 

1 

-4 

0 

1 

0 

1 

-6 

0 

1 

1 

1 

-12 

1 

0 

0 

1 

+2 

1 

0 

1 

1 

+4 

1 

1 

0 

1 

+6 

1 

1 

1 

1 

+12 


Referring now to FIGS. 3A, 3B, 4A, and 4B, the method 
for computing in-phase and quadrature accumulations for 
every accumulation period, for example every millisecond 
for GPS C/A code, by use of bit-wise parallelism includes 
the steps of selecting carrier replica signal 25 (FIG. 3A) 
according to the proximity of its frequency to the desired 
frequency, and representing sample signal data 21 (FIG. 3A) 
and carrier replica signal 25 (FIG. 3A) from at least one 
channel as bits in signal sign 21A(FIG. 3A) and, if present, 
signal magnitude 21B (FIG. 3A) and carrier replica sign 25A 
(FIG. 3A) and carrier replica magnitude 25B (FIG. 3A) 
(method step 101, FIG. 4A). Note that carrier replica signal 
25 (FIG. 3A) is chosen so that its frequency is close to the 
correct signal frequency. The method also includes the step 
of mixing signal data 21 (FIG. 3A) to baseband by comput- 
ing in-phase and quadrature baseband mixed sign 23A(FIG. 
3A) and in-phase and quadrature baseband mixed high and 
low magnitude 23B/C (FIG. 3A) (method step 103, FIG. 
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4A). The method further includes the steps of selecting PRN 
code from pre-computed over-sampled PRN code table 28 
(FIG. 3A) or of computing it using real-time over-sampled 
PRN code generator 30A (FIG. 3A), representing prompt 
PRN code 29 (FIG. 3A) as prompt PRN code sign 29 A (FIG. 5 
3A), and representing early-minus-late PRN code 35 (FIG. 
3A) from as early-minus-late PRN code sign 35A(FIG. 3 A) 
and early-minus-late PRN code zero mask 35B (FIG. 3A) 
(method step 105, FIG. 4A). The method further includes the 
step of de-spreading in-phase and quadrature baseband 10 
mixed signal 23 (FIG. 3A) by mixing it with prompt PRN 
code 29 (FIG. 3A) and early-minus-late PRN code 35 (FIG. 
3A), resulting in in-phase and quadrature fully mixed 
prompt integrands 31 (FIG. 3A), and fully mixed early- |S 
minus-late integrands 33 (FIG. 3A) (method step 107, FIG. 
4A). The method further includes the step of using prompt 
value word logic 27A (FIG. 3B) to compute prompt inte- 
grand value words 27 (FIG. 3B) from the in-phase and 
quadrature fully mixed prompt integrands 31 (FIG. 3A). The 20 
method further includes the step of using early-minus-late 
value word logic 37A(FIG. 3B) to compute early-minus-late 
integrand value words 37 (FIG. 3B) from the fully mixed 
early-minus-late integrands 33 (FIG. 3B) (method step 109, 
FIG. 4A). The method further includes the steps of summing 25 
over each prompt integrand value word 27 and early-minus- 
late integrand value word 37 (FIG. 3B) the number of one 
bits (or zero bits) using one bits summation table 38 (FIG. 
3B) or using a processor command if available (method step 
111, FIG. 4B), and summing, over the accumulation interval, .to 
the number of one bits (or zero bits) in each prompt 
integrand value word 27 and early-minus-late integrand 
value word 37 to produce prompt accumulations 41 (FIG. 
3B) and early-minus-late accumulations 49 (FIG. 3B) 
(method step 113, FIG. 4B). The method further includes the 35 
step of multiplying prompt accumulations 41 (FIG. 3B) and 
early-minus-late accumulations 49 (FIG. 3B) by correspond- 
ing values 41 A and summing the results over the value 
words of each signal for an entire accumulation interval to 
yield in-phase and quadrature summed prompt accumula- 40 
tions 45 (FIG. 3B) and summed early-minus-late accumu- 
lations 47 (FIG. 3B) (method step 115, FIG. 4B) that are 
stored for use by acquisition techniques or tracking loops. 
The method further includes the step of rotating the in-phase 
and quadrature summed prompt accumulations 45 (FIG. 3B) 45 
and summed early-minus-late accumulations 47 (FIG. 3B) 
(method step 117, FIG. 4B) to simulate a condition in which 
baseband mixing had been performed using cosine and sine 
signal replicas with the correct frequency and phase. If there 
are more channels to process (decision step 119, FIG. 4B), 50 
the method includes the step of repeating the previous steps 
beginning at method step 101, FIG. 4A. If there are no more 
channels to process (decision step 119, FIG. 4B), the method 
includes the step of setting parameters for the next accumu- 
lation period, including storing current C/A code phases, 55 
epoch counters, carrier phases, and carrier Doppler shifts 
(method step 121, FIG. 4B). If the time period to wait until 
the next accumulations need to be calculated has not expired 
(decision step 123, FIG. 4B), the method includes the step 
of sleeping until the expiration of the time period (method 60 
step 125, FIG. 4B). If the time period has expired (decision 
step 123, FIG. 4B), the method includes the step of repeating 
the previous steps beginning at method step 101, FIG. 4A. 
The length of the time period depends on the nominal 
accumulation period. It is set to be less than this period, 65 
normally between 50% to 90% of this period, to reduce the 
possibility that accumulations are missed for any channels. 


Referring again to FIGS. 3B and 4A, method step 109 
(FIG. 4A) calls for computing value words. This computa- 
tion starts by performing bit-wise parallel Boolean logic for 
each of the possible values in the right-hand column of the 
prompt integrand representation in Table 6. A 32-bit prompt 
integrand value word 27 (FIG. 3B) is computed for each 
thirty-two samples and each row of Table 6. The prompt 
integrand value word 27 (FIG. 3B) contains ones for the 
sample times when the actual integrand equals the corre- 
sponding value in the right-hand column of Table 6, and 
zeros for the remaining times when the actual integrand does 
not equal this value. The prompt integrand value words 27 
(FIG. 3B) corresponding to the possible Table 6 values are 
formed by method step 109 (FIG. 4A) as follows: 


MINUSONE=NOT(SIGN) AND [NOT(HIGHMAG) 

AND NOT(LOWMAG)] (13) 

MINUSTWO=NOT(SIGN) AND [NOT(HIGHMAG) 

AND LOWMAG] (14) 

MINUSTHREE=NOT(SIGN) AND [HIGHMAG 

AND NOT(LOWMAG)] (15) 

MINUSSIX=NOT(SIGN) AND [HIGHMAG AND 

LOWMAG] (16) 

PLUSONE=SIGN AND [NOT(HIGHMAG) AND 

NOT(LOWMAG)] (17) 

PLU STW 0=SIGN AND [NOT(HIGHMAG) AND 

LOWMAG] (18) 

PLUSTHREE=SIGN AND [HIGHMAG AND NOT 

(LOWMAG)] (19) 

PLUSSIX=SIGN AND [HIGHMAG AND 

LOWMAG] (20) 


Continuing to refer to FIGS. 3A, 3B, 4A, and 4B, method 
steps 109 (FIG. 4A), 111 (FIG. 4B), and 113 (FIG. 4B) call 
for operations for the fully mixed early-minus-late inte- 
grands 33 (FIG. 3A) that are similar to those for the fully 
mixed prompt integrands 31 (FIG. 3A). Early-minus-late 
integrand value words 37 (FIG. 3B) correspond to values 
that are double those of the prompt integrand value words 27 
(FIG. 3B), i.e., the MINUSSIX word becomes the 
MINUSTWELVE word. Also, an additional AND operation 
must be performed with the zero mask bits of Table 7 in 
order to mask out sample times when the early and late PRN 
codes cancel each other. Possible formulas for the method 
step 109 (FIG. 4A) computation of these early-minus-late 
integrand value words 37 (FIG. 3B) are as follows: 


MINUSTWO=[ZEROMASK AND NOT(SIGN)] 

AND [NOT(HIGHMAG) AND NOT 

(LOWMAG)] (21) 

MINUSFOUR=[ZEROMASK AND NOT(SIGN)] 

AND [NOT(HIGHMAG) AND LOWMAG] (22) 

MINUSSIX=[ZEROMASK AND NOT(SIGN)] AND 

[HIGHMAG AND NOT(LOWMAG)] (23) 

MINUSTWELVE=[ZEROMASK AND NOT(SIGN)] 

AND [HIGHMAG AND LOWMAG] (24) 

PLUSTWO=[ZEROMASK AND SIGN] AND [NOT 

(HIGHMAG) AND NOT(LOWMAG)] (25) 

PLUSFOUR= [ZERO MASK AND SIGN] AND [NOT 

(HIGHMAG) AND LOWMAG] (26) 



US 7,010,060 B2 


22 


21 

PLUSSIX= [ZEROMASK AND SIGN] AND 

[HIGHMAG AND NOT(LOWMAG)] (27) 

PLUSTWELVE= [ZEROMASK AND SIGN] AND 

[HIGHMAG AND LOWMAG] (28) 5 

Additional zero masking can occur in the first and last words 
of an accumulation interval. This is true because the start and 
stop times of an accumulation interval do not normally fall 
at the boundaries of data words. Therefore, the bits in the 10 
first word that precede the accumulation interval may need 
to get zero masked as might the bits in the last word that 
come after the end of the accumulation interval. 

Referring primarily to FIGS. 3B and 4B, the one bits 
counting operations of method step 111 (FIG. 4B) form the 45 
count of the number of one bits in each of the eight value 
words. If there are no such counting operations in the 
instruction set of microprocessor 16 (FIG. 1), the counting 
can be accomplished using a table look-up. In the case of a 
table look-up, prompt integrand value words 27 and early- 20 
minus-late integrand value words 37 (FIG. 3B) can be used 
as addresses in one bits summation table 38 (FIG. 3B), and 
one bits summation table 38 (FIG. 3B) can output the 
number of one values (or zeros) in the address. For example, 
if the table look-up operation is called BITSUM, the fol- 
lowing computations can be performed to compute one -bits 
counts: 

ONESCOUNT=BITSUM(VALUEWORD) (29) 

.50 

where the output of the table ONESCOUNT is the number 
of one bits in the word VALUEWORD. This operation is 
repeated for each of the prompt integrand value words 27 
(FIG. 3B) and early-minus-late integrand value words 37 
(FIG. 3B) in order to accomplish method step 111 (FIG. 4B). 35 
Selection of table width, for example 16-bit or 32-bit, 
depends on the amount of memory available and other 
design decisions. If the table width is smaller than the 
number of bits in a value word, then multiple calls of the 
table are used in order to sum up the total number of one 40 
values in a given value word. Each call takes as input only 
a portion of the bits in the value word. 

Continuing to refer primarily to FIGS. 3B and 4B, the 
accumulation operations of method steps 113 (FIG. 4B) and 
115 (FIG. 4B) sum the one bit counts for each prompt 
integrand value word 27 (FIG. 3B) and for each early- 
minus-late integrand value word 37 (FIG. 3B) over the entire 
accumulation interval, multiply each result by the value 41A 
(FIG. 3B) that is associated with the value word, and sum all 
of these scaled value accumulations to form the accumula- 
tions of equations (6) and (7), summed prompt accumulation 
45 (FIG. 3B) and summed early-minus-late accumulation 47 
(FIG. 3B). For example, the following computations can be 
performed to compute the in-phase summed prompt accu- 
mulation 45 in equation (6) as follows: 

N w (30) 

l gj k( 0 ) = ONESCOUm-l), p t - 

i=i 60 

N w 

2^ ONESCOUNT(- 2 ) lpl - 

i= 1 

N w 

3^ ONESCOUNT(-3), pl - 65 

1=1 


-continued 

N w 

6^ ONESCOUNT{-6), pl + 

t=i 

N w 

^ ONESCOUNT{+ l) lp[ + 

i= l 
N w 

2 V ONESCOUNT(+2) lpl + 
1=1 

N w 

3^ ONESCOUNT(+3) lpl + 
(=1 

N w 

6 V ONESCOUNT(+6), pl 
(=1 


where 1 is the index of successive bit-wise parallel data 
words in the accumulation interval, N M , is the total number 
of data words in the interval, and ONESCOUNT(k) 7/ ,, is the 
ones count for the corresponding value word 41 (FIG. 3B) 
associated with value k 41A (FIG. 3B) for the 1th data word 
interval and the in-phase summed prompt accumulation 45 
(FIG. 3B). The quadrature summed prompt accumulations 
45 (FIG. 3B) and the in-phase and quadrature summed 
early-minus-late accumulations 47 (FIG. 3B) are calculated 
in a similar manner. The only difference is in the actual 
ONESCOUNT values used and, for the case of early-minus- 
late signals, the set of k values 41A (FIG. 3B). 

Continuing to refer primarily to FIGS. 4A and 4B, the 
method of the present invention can be adapted to work with 
a different number of bits in the representation of the RF 
front-end output and of the baseband mixed signals. An 
increase above two bits can make the logic more complex 
and may decrease the time savings over straight integer 
arithmetic. A decrease to a 1-bit representation can have the 
opposite effect. For example, if the RF front-end uses 1-bit 
digitization rather than 2-bit digitization while carrier replica 
signal 25 (FIG. 2B) retains its 2-bit digitization, then the 
operation count can decrease by a factor of almost two for 
the 1-bit method, which can make the logic execute about 
4.2 times faster than straight integer arithmetic. 

Returning to the discussion of determining PRN code, and 
now referring again FIGS. 2C, 2D, and 3A, the real-time 
generation of bit-wise parallel over-sampled prompt PRN 
code sign 29A (FIGS. 2D and 3A), early-minus-late PRN 
code sign 35 A (FIGS. 2D and 3 A), and early-minus-late 
PRN code zero mask 35B (FIGS. 2D and 3A) can be carried 
out by real-time over-sampled PRN code generator 30A 
(FIG. 3A). The inputs to this calculation are the actual PRN 
code chip length 65 (FIG. 2D), At c , the sample interval 63 
(FIG. 2D), AU, the nominal early-to-late code delay 61 (FIG. 
2D), At_, m/ , the end time of the first code chip relative to the 
first sample time, or put another way, the time lag At 0 , 67 
(FIG. 2D) from the first RF sample time to the end time of 
the first prompt PRN code chip, and prompt code chips 91 
(FIGS. 2D and 6). The outputs are the three integers that 
store the prompt PRN code sign 29A (FIGS. 2D and 3A), 
early-minus-late PRN code zero mask 35B (FIGS. 2D and 
3A), and early-minus-late PRN code sign 35A (FIGS. 2D 
and 3A), which are all in bit-wise parallel format. 

Referring again to FIGS. 2C and 2D, table look-ups can 
be used to translate a PRN code and its timing information 
to bit-wise parallel representations of its over-sampled 
prompt and early-minus-late versions. The required table 
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look-ups can be simplified by recognizing that the following 
parameters are substantially constant, for the purposes of 
this calculation: sampling interval 63 (FIG. 2D), At,, the 
nominal chip length, At cnom , the early-minus-late code delay 
61 (FIG. 2D), At cnlZ , used by software correlator 19 (FIG. 1), 5 
and the maximum number of chips that span a data word of 
microprocessor 16 (FIG. 1). The difference between the 
actual chipping rate Af c (reciprocal of At c ) and the nominal 
chipping rate Af CJJom (reciprocal of At cnom ) that is used for 
the above simplification can be accommodated by correcting 10 
time lag 67 (FIG. 2D), At 0 , for the average effects of Doppler 
shift, a procedure discussed later. Using the simplification, 
each look-up table has two variable inputs: the actual set of 
prompt code chips 91 (FIGS. 2D and 6) and time lag 67 
(FIG. 2D), At 0 . A table look-up procedure for each signal 15 
component yields a single integer result for prompt PRN 
code sign 29A (FIGS. 2D and 3A), another single integer 
result for early-minus-late PRN code zero mask 35B (FIGS. 

2D and 3A), and yet another single integer result for 
early-minus-late PRN code sign 35A (FIGS. 2D and 3A). 20 

Time lag 67 (FIG. 2D), At 0 , can take on an infinite number 
of values in the continuous range: 




24 

direction of -oo. This k m ,-„ value can cause the minimum At 0< . 
to fall slightly below the lower limit in equation (31), which 
can cause memory inefficiency, but this value is advanta- 
geous because it may simplify some further computations. 

The size for each table can be a function of the maximum 
number of code chips that may fall within a data word’s 
sample range. Given At 0 , bit information for the following 
number of code chips is required in order for the prompt 
PRN code 29 (FIG. 2D), early code 69B (FIG. 2D), and late 
code 69C (FIG. 2D) to be fully specified at all of the data 
word’s sample times: 

! 1 1 (34) 

[h s - l]Ar, - A/ 0 + - A r m , 

/(A/o) = flood = 1 + 2 


where n, is the number of data samples that can be stored in 
bit-wise parallel format in each word. It is clear from 
equation (34) that l(At 0 ) is a non-increasing function of At 0 . 
Therefore, the maximum number of required chips occurs at 
the minimum value of At 0 : 

L=m<Xmin) (35) 


This range’s lower limit guarantees that the end time of the 
first late chip occurs no earlier than the first sample time. A 
lower time lag 67 (FIG. 2D) At 0 value would make the first 
chip irrelevant to the prompt PRN code 29 (FIG. 2D), early 
code 69B (FIG. 2D), and late code 69C (FIG. 2D) at all of 
the sample times. The upper limit in equation (31) guaran- 
tees that the start time of the first late chip occurs no later 
than the first sample. A larger value of At 0 would leave the 
late code 69C (FIG. 2D) at the first sample time undefined 
based on the available code chips. 

Referring now to FIG. 5, to create an electronically 
processable table, the continuous range of At 0 values can be 
replaced with a discrete grid having m equally spaced points 
per sample interval 63 (FIG. 2D), At,. The integer m is 
chosen to be large enough so that the granularity At,/m gives 
sufficient PRN code timing resolution. In GPS applications 
m is usually chosen to be large enough so that (cAt^m) is on 
the order of several meters or less, where c is the speed of 
light, but reasonably sized because the table sizes are usually 
proportional to m. Given a choice of m, the grid of relative 
end times of the first prompt code period is: 


&tok = for k = k m ^ , . . . , k max 

m 


where the limits 


The size of each table can be determined from the param- 
eters k„„,„ k, m „, and L. The grid contains k M =(k m „-k ml „+l) 
different time offsets of the first code chip. At each of these 
grid points there are 2 L possible combinations of the code 
chips. Thus, each table optimally contains k ror x2 i entries, 
and each entry is optimally an unsigned integer in the range 
from 0 to 2"'-l. 

Continuing to refer to FIG. 5, each table can be stored as 
an array with a single index. The first 2 L entries correspond 
to the 2 l different possible chip sequences that can occur at 
At 0 =At 0A . the next 2 L entries correspond to At 0 = 
Ato(fom>i+i)> and so forth. The tabulated bit sequences for a 
fixed At 0 are ordered by interpreting the sequence as a binary 
index counter with the first chip being the most significant 
counter bit and the L lh chip being the least significant bit. 
The integer elements of the table can be the x(i) table 
elements 81 with corresponding code time offset 83 At oto 
and corresponding bit sequence 85 of the chips. The array 
index of a given x(i) table element 81 can be computed 
based on its code time offset 83 At 0Ar grid index k and its 
corresponding bit sequence 85. The corresponding bit 
sequence 85 associated with the array index consists of the 
chip values C(l), C(2), C(3), . . . C(L). The C(j) chip values 
are either zero or one, with zero representing a -1 PRN code 
value and one representing a +1 PRN code value, and they 
are listed in order of increasing time. The corresponding 
array index of the x(i) table element 81 is: 


.50 


35 


40 


45 


50 


55 


kmin 


floor! 


Ytlbdeml \ n 

2A t s / 


(33a) 

60 


l (36) 

i[k, C( 1), C(2), C(3), , C{L)\ = 1 + {k-k^x 2 L + ^ C(j)2 L ~ J 

7=1 


k max = flood 


m\At c - - Ar^J 

~KF S 


(33b) 


65 

provide full coverage of the interval defined in equation 
(31). The floor( ) function rounds to the nearest integer in the 


This equation can be inverted to give the code time offset 83 
grid index k and the corresponding bit sequence 85 as 
functions of the x(i) table element 81 index i: 

k(i)=k min +f[.oox[{i-l)l2 L ~\ (37a) 
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C(j, i) = mo. 




for j= 1,2,3, 


(37b) 


where mod(y,z)=y-zxfloor(y/z) is the usual remainder func- 
tion. 

Continuing to refer to FIG. 5, the following computations 
generate the x(i) table elements 81 entries of the three tables. 
Given i, the corresponding code time offset 83 grid index 
k(i) is computed from equation (37a) and is used to generate 
three sequences of chip indices: 


IS 


-“](£]> - 

= 1,2, 3, ... 

(38a) 

, Us 

m J L J + 1. 2A t c \) 

for n = 1 , 

(38b) 


2, 3, .. 

.. ,n s 

*(01r i rAf OT( -i, 
m JUrJ L 2A rj) 

for n = 1, 

(38c) 


2, 3, .. 

.. ,n s 
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where n is the index of the sample time within the over- 
sampled data word. The integer j p (n,i) is the index of the 
PRN code chip that applies at sample n for the prompt PRN so 
code 29 (FIG. 2D). The integers j e (n,i) and j ; (n,i) are defined 
similarly for the early code 69B (FIG. 2D) and late code 69C 
(FIG. 2D), respectively. The formulas in equations (38a)- 
(38c) amount to time measurements of each sample given in 
units of chip lengths past the first chip. These indices, in ss 
turn, can be used to determine the chip values that apply at 
the sample times: 


Cp(n,t)=C[j p (n,t)-,C\ tor n= 1, 2, 3, . . 

• , n 5 

(39a) 

C 0 (n,i)=C[j s (n,i);C\ for n= 1, 2, 3, . . 

• , n s 

(39b) 

C/(n,i)=C[jfat);i] for n= 1, 2, 3, . . 

• > n s 

(29c) 


where C p (n,i) is the over-sampled prompt PRN code 29 
(FIG. 2D), and C e (n,i) and Q(n,i) are, respectively, the early 45 
code 69B (FIG. 2D) and late code 69C (FIG. 2D). Each of 
these code bit values is either zero or one, as dictated by the 
outer mod(,2) operation in equation (37b). These over- 
sampled chip values can, in turn, be used to formulate 
tabulated functions x p (i), x emfam (i), and x em/2s ( i) that gener- 50 
ate the unsigned integers that constitute the bit-wise parallel 
code representations of the three tables: 

2% (40a) 55 

VO = X C p( n - 0x2" s ~" 

n=l 

A (40b) 

=2j mod)[C t (n, i) + C,(n, 0], 2)x2" s " 

" =1 60 

A _ l 4011 ) 

■tali) = mod {[C„(n, i) + C,(n. 0], 2|xC,(«, ;)x2" s " 


where x^(i) is the entry of the prompt sign table, x £ , m/ , m ( i) is 
the entry of the early-minus-late zero mask table, and 
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x emi2s0) is lh e entry of the early-minus-late 2’s sign table. 
Note that the formula used in equation (40c) is only an 
example illustrative embodiment of the early-minus-late 2’s 
sign table calculation. It places zeros in all of the X entries 
of early-minus-late PRN code sign 35A (FIG. 2D). There 
exist alternate formulas that are equally correct but that do 
not place zeros in the X entries. 

The table layout in FIG. 5 is only an illustrative embodi- 
ment of how one can construct a table that can be used to 
translate PRN code chip values and timing information into 
data words that store the bit-wise parallel representations of 
the over-sampled prompt PRN code sign 29A (FIG. 2D), 
early-minus-late PRN code zero mask 35B (FIG. 2D), and 
early-minus-late PRN code sign 35A(FIG. 2D). Other table 
layouts are also possible. Possible illustrative index calcu- 
lations are described below for indexing into the tables for 
PRN code retrieval during accumulation calculations If 
another table layout is used, then different indexing calcu- 
lations might be needed. Furthermore, different indexing 
calculations can be used even for the illustrative table layout 
shown in FIG. 5. 

Referring now primarily to FIG. 6, accumulation calcu- 
lations, as have been previously outlined herein and else- 
where, work with a fixed sequence of code chips. The 
prompt version of this sequence has a specified timing 
relationship to the incoming RF signal data 21 (FIG. 2A). 
This relationship can be pre-determined by a code search 
algorithm if software receiver 10 (FIG. 1) is in acquisition 
mode or by its delay-locked loop if it is in tracking mode. 
Software correlator 19 (FIG. 1) can calculate an accumula- 
tion using prompt code chips 91 (FIG. 6) C(l) through 
C(M). The timing of the prompt replicas of prompt code 
chips 91 (FIG. 6) can define the accumulation interval. The 
chip sequence starts at start lag 93 (FIG. 6) At Ito „ seconds 
past the first sample of data word W, 95 (FIG. 6), it chips at 
the constant chipping rate f c =l/At c , and it ends at end time 
97 (FIG. 6), which occurs At iM „+MAt c seconds after the first 
sample of data word Wj 95 (FIG. 6). The end of the M' h 
prompt code chip can occur during data word 99 (FIG. 
6), which implies that 


I = cei l( 


A t s1a „ + MAr c , 
A/, / 


(41) 


where the ceil( ) function rounds to the nearest integer 
towards +o°. Some of the initial bits of data word Wj 95 
(FIG. 6) and some of the final bits of data word 99 (FIG. 
6) may not be included in the accumulation. Let n ex0 be the 
number of initial bits of data word Wj 95 (FIG. 6) that are 
excluded, and let n e ^be the number of final bits of data word 
W N 99 (FIG. 6) that are excluded. The timing relationship in 
FIG. 6 implies that these numbers are: 



(42a) 


n rx j = n s N - ceil| 


. /Ar sta „ + MAr c , 


A r. 


(42b) 


These sample counts can be used to develop additional 
zero mask words that software correlator 19 (FIG. 1) uses to 
properly process the first and last data words during its 
bit-wise parallel accumulation calculations, as defined in A 
12-Channel Real-Time GPS LI Software Receiver, B. M. 
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Ledvina et al.. Proceedings of the ION National Technical 
Meeting, Jan. 22-24, 2003, Anaheim, Calif, and Bit-Wise 
Parallel Algorithms for Efficient Software Correlation 
Applied to a GPS Software Receiver, B. M. Ledvina et al., 
to appear in the IEEE Transactions on Wireless Communi- 
cations, 2003, both incorporated herein in their entirety by 
reference. Note that equations (41)-(42b) and all related 
timing considerations herein use the following code chip 
start/stop convention: a sample is correlated with a particular 
code chip if the start time of the code chip coincides exactly 
with the sample time, but it will not get correlated with that 
chip if its sample time coincides exactly with the end time 
of the code chip. 

Continuing to refer to FIG. 6, efficiently determining the 
correct x^(i), x emlzm (i), and x emC ,(i) bit-wise parallel code 
representations for the N data words 95 (FIG. 6) through 
data word 99 (FIG. 6) involves making an efficient 
determination of the correct table index y, that corresponds 
to data word W v for v=l, . . . , N, where the table index y 
is a function of start lag At, ra , r 93 (FIG. 6), actual PRN code 
chip length At, 65 (FIG. 6), v, and prompt code chips C(0), 
C(l), C(2), . . . , C(M+1) 91 (FIG. 6). The chip value C(0) 
94 (FIG. 6) is needed in order to specify the late code 69C 
(FIG. 2D) at the initial few samples of the accumulation, and 
the chip value C(M+1) 96 (FIG. 6) is needed to specify the 
early code 69B (FIG. 2D) at the final few samples. Addi- 
tional constants that can be used in order to determine the 4 
indices are At,, n,, m, L, k mj>1 , k max , and nominal chip length 
At cnom , which has been used to generate the three x(i) tables. 

The first step of the index calculation procedure pre- 
computes and stores a table of candidate integers for the final 
summation term that appears on the right-hand side of 
equation (36). This table takes the form: 


L (43) 

A/( J u) = 2 J C(ft + j-L- 1)2 l ~ j for ^ = 1, 2, 3 (M+L+ 1) 

j=i 


This computation requires the undefined chip values 
C(-L+l), C(-L+2), C(-L+3), . . . , C(-l), and C(M+2), 
C(M+3), C(M+4), . . . , C(M+L). The value zero can be used 
for each of these undefined chips because they can affect the 
over-sampled codes only for the first n ex0 samples of data 
word Wj 95 (FIG. 6) or for the last n e ^samples of data word 
Wjy 99 (FIG. 6), none of which are part of the accumulation. 
The table of equation (43) can be constructed by using the 
following iterative procedure: 

Al'(l)=C(0) (44a) 

Ai(jti)=mod[2Ai(/i-T),2 L ]+C(/i-l) for //=2, 3, 4, ... , 

(M+2) (44b) 

Af(«)=mod[ 2Ai(u-l),2 L ] for //=(A/+3). (M+4), . . . , 

(M+L+l) (44c) 

Note that the mod(2x,2 i ) operation in the latter two equa- 
tions can be replaced by a single truncated leftward bit shift. 

In many cases prompt code chips 91 (FIG. 6) C(0), C(l), 
C(2), . . . can be generated as the output of a feedback shift 
register or a system of such registers. For example, the new 
GPS civilian L2 signals can be generated this way. In this 
case, each iteration of equation (44b) can be interleaved with 
an iteration of the shift register calculations. Shift-register 
generation of PRN codes is well-known in the art. 

An alternative to building up the previously-described 
table is to calculate the index component only for one data 
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word at a time. Suppose that Ay is the correct index 
component for data word W v , and that u v is the auxiliary 
index that would have been used to determine Ay from the 
Ai ( m) table had the table existed. In order to calculate Ai v+1 
5 for data word W v+1 , y v+1 is computed (procedure defined 
herein), feedback shift register calculations that generate 
C(f< 84 ), C(/y+ 1), C(/y+ 2), . . . , C(ty +1 - 1) are iterated, and 
the resulting chip values are used to perform (/y +1 -/y) 
iterations of equations (44b) or (44c). 

10 Determination of the correct index into the x (i), x emfam (i), 
and x eml2s (i) tables for data word W v can be reduced to the 
determination of two quantities. One is the time offset index 
ly that causes At 0ltv from equation (32) to match the true time 
offset for data word W v as closely as possible. The other 
15 quantity is the auxiliary table index /y. It constitutes an 
index for the sequence of actual code chips that are associ- 
ated with data word W v . Given these two quantities, the 
correct index for the three x(i) tables is 

£ v =l+(A v -£ m Ax2 L +Ai(it v ) for v=l, 2, 3 . . . , N (45) 

20 

The auxiliary index /y is determined by the position of the 
W v data word relative to the PRN code chip sequence. Once 
that position has been ascertained, the index ly can be 
calculated from the position relative to the W v samples of 
25 the L code chips that are associated with the index /y. 

A time integer can keep track of the number of fine-scale 
time units in a given interval. The fine-scale time unit is a 
small fraction of the sample interval 63 (FIG. 2D), At,: 

30 


35 where rry is the integer number of fine-scale time intervals 
per sample interval 63 (FIG. 2D), At,. This number is chosen 
large enough, for example ny>2mN, to preclude any sig- 
nificant build-up of timing errors during an accumulation 
interval due to the finite time resolution Ay N is the number 
40 of data words in the accumulation interval. The calculation 
of the ly values over one accumulation interval involves 
approximately N iterative time increments, each of which 
has a resolution of Ay. If ny obeys the inequality given 
above, then the cumulative timing errors due to the finite 
45 precision Ay will be less than the timing error caused by the 
finite timing precision of the x(i) tables. Normally it is 
possible to make ny much larger than 2mN and still keep all 
of the relevant calculations within the size limits of a 32-bit 
signed integer. If nyis a power of two, a rightward bit shift 
50 operation can be used to implement integer division by ny. 
Time unit Ay can be used to define an integer that approxi- 
mately keeps track of the code/sample time offset At 0v for 
data word W v : 

55 

(Ar 0v ) (47) 

Kf v = round = round 

1 (A tj) V A r, J 


60 where the round) ) function rounds up or down to the nearest 
integer. The time lag 67 (FIG. 2D), At 0v , is the amount by 
which the end time of PRN code chip C(ty-L) lags the first 
sample time of data word W v . The algorithm that iteratively 
determines ly, tries to keep the relationship in equation (47) 
65 exact, but using only integer operations can allow small 
errors to build up. Note that k^/nyely/m, as implied by a 
comparison of equations (32) and (47). This relationship can 
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be used to determine k^, from an iteratively determined k^,. 
Several constants are required by the iterative procedure that 
determines V k^,, and /.i v . The first five constants are used 
to account for the difference between the nominal chip 
length At cnojn , used to generate the x(i) tables, and the actual 5 
chip length 65 (FIG. 2D), At c used in the accumulation: 


k fm id=^und[(n s -l)m/2] 


(48a) 

Af c — A tcnom 
A= Ar c 


(48b) 

[7 mfk m in 

= ceil — — 

|a 2 J sign(A) 

(48c) 

bfx ~ 1 2"' , M°jm>A)/ 1 °s( 2 >] 

if A t c = A t cmm 
if A t c # Af cra , 

(48d) 

«A<= round ( X V) 


(48e) 


where the sign( ) function returns +1 if its input argument is 
positive, zero if the argument is zero, and -1 if the argument 
is negative. The index kf mid is approximately half the length 
of a data word as measured in units of Aty seconds. During 
an accumulation, the rational factor a^/b^ v gets multiplied 
by the time offset between the end time of the first code chip 
and the midpoint of the data word. The result is a time .50 
perturbation that removes the average effect of the difference 
between the actual and nominal PRN code chipping rates. 
The time perturbation can be used to compute a corrected V 
value: 


[ a r, x I (49) 

kfifix(kjv) = k f v + round - k fi )— j 

40 

Equation (48d) picks b^ Y to equal a power of two so that 
the integer division by in equation (49) can be accom- 
plished using a rightward bit shift operation. The round) ) 
operation in equation (49) can be accomplished as part of the 
division if one first adds sign(a / j T )xb /iT /2 to the quantity 45 
(WV) xa^ before performing the rightward bit shift that 
constitutes division by b^ v . This approach can give the 
correct k^ T because the signs of ( k/„„ v/ -k /v ) and b^ v are both 
positive and because the rightward bit shift has the effect of 
rounding the signed division result towards zero. An alter- 50 
nate implementation of the round function could be used for 
applications that do not guarantee k^,„ w >k^,. Such applica- 
tions are normally associated with L=2 PRN code chips per 
data word. SJ 

Five additional constants can be used to define the k^, and 
ft v iterations: 


-continued 



The constant L is the typical number of code chips per 
data word. It is the nominal increment to /( v per data word. 
The constant Ak^. equals the number of fine-scale time 
intervals per PRN code chip. The constant Ak*. is used to 
adjust k^, up or down if kyv falls outside of the limits: 
kp, lin = k^, = k^, av . The constant Ak^ is the nominal incre- 
ment to per data word. The limits ky,„„ and k max are 
approximately the limits k min and k max from equations (33a) 
and (33b) re -scaled to the new fine time scale and adjusted 
for the difference between the nominal code chipping rate of 
the x(i) tables and the actual chipping rate of the accumu- 
lation. The extra -2 term on the right-hand side of equation 
(33a) is compensated for by the increment to k„„„ on the 
right-hand side of equation (50d) and the decrement to k max 
on the right-hand side of equation (50e). The original -2 
term and the increment and decrement have been included 
because they ensure that ky values which respect the limits in 
equation (50d) and (50e) are transformed into k values that 
respect the limits in equations (33a) and (33b). 

The iteration begins by initializing k^ and for the first 
data word. The nominal initial values are: 


kflnom = round. 


/'I nom = floor I 


I At s tart \ (~&hra 

hr) +1+floor (^- 


Ar^mn'l 

Ar s Jj 


\t c 


+ 1+L 


(51a) 

(51b) 


It is possible that k flnom from equation (51a) can violate its 
upper limit k^^. Therefore, the following conditional 
adjustment can be implemented in order to finish the ini- 
tialization. 



1 kf lnom if kf lnom < kf max 

(52a) 

k n = 

\ kflnom - A£/c if kf mgx < kflnom 


H 

' Pi nom if kflnom ^ */max 

, Pi nom - 1 if k /max < kflnom 

(52b) 


Lty P = round | 


(50a) 

v Ar c ) 

Akf c = round 

fnifkt c \ 

{ Ar 5 ) 

(50b) 

M fiyp = rounc 

[ m fLtypbl c 'j 

(50c) 

l Ar, ) n ‘” ,f 



Given this initialization, the calculation of (k y,„/< 2 ), (k /3 ^r 3 ), 
(ky 4 ,« 4 ), . . . , (k py/v) proceeds according to the following 
iteration: 

kfs.’„o,„=kfi v -i)+kkf, vp for v=2, 3, 4, . . . , N (53a) 


Mv„om=/4y-l)+ L ,yp for V=2, 3, 4, . . . , N 


(53b) 
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{ kjvnom + Akf c if kfvnom < k fmin 

kjvnom if kfinin — k fvnom — ^/max for V = 2, 3, 4, . . . , N 

kjvnom AA/ C if kjinax < kj vnom 

! f^vnom "^"1 if kjvnom ^ kj m in 

ftvnom if kjfnin S kjvnom — kf max for V = 2, 3, 4, . . . , N 

ftvnom 1 if kjynox < kjvnom 

The table look-up calculations finish with the computation 
of ky, i v , and the actual table look-ups: 

k v = round f m ^ ( ^ v) 1 for v = 1, 2, 3, ... , N 

[ m f \ 

The round( ) operation in equation (55) can be imple- 
mented by adding my2 to mxky^k*,) before the rightward 20 
bit shift that constitutes division by nu The result of the 
division will be the correct value of lq, for any sign of 
if the computer works with 2’s compliment nota- 
tion for signed integers and if the rightward bit shift fills in 
from the left with the 2’s compliment sign bit, i.e., with the 25 
left-most bit. 

Given k^, from equation (55) and /u v from equation (54b), 
one can use equation (45) to compute i v . This value, in turn, 
can be used to index into the tables to determine the Prompt 
PRN code sign 29A (FIGS. 2D and 3A), x^, the early- .50 
minus-late PRN code zero mask 35B (FIGS. 2D and 3A), 
x emizmv> and the early-minus-late PRN code sign 35A(FIGS. 

2D and 3A), x eml2sv , that correspond to data word W v : 


Xpy=x p (iy) for v=l, 2, 3 .... N 

(56a) 

for v=l, 2, 3 N 

(56b) 

O V=l, 2, 3 .... N 

(56c) 


The conditionals in equations (54a) and (54b) can be 
reduced to a single conditional per data word during normal 40 
operation to improve efficiency. This can be done because 
the sign of Ak^ in equation (53a) is fixed for a given 
accumulation interval. (Normally the sign of Ak ^ does not 
vary from accumulation interval to accumulation interval or 
from channel to channel for a given receiver because the 45 
only variable quantity that affects Ak ^ is actual chip length 
65 (FIG. 2D), At c , which normally does not vary signifi- 
cantly.) If Ak^ xp <0, then the proper formula for determining 
V and /< v can be chosen by considering the inequality 

<kf mi „. Conversely, if Ak^s-O, then the proper for- 50 
mula can be determined by considering the inequality 
kfvnom > kf max - The decision about which condition to check 
can be made at the beginning of the accumulation because 
Aky^.p is calculated prior to execution of the iteration in 
equations (53a)-(56c). 55 

When using a processor that creates instruction pipelines, 
"if” statements can disrupt the pipeline. In this case equa- 
tions (54a) and (54b) can be replaced with the following 
computations: 


min[0, sign kfvnom Gm in )1 s. 0 

0 = 0 for v = 2, 3,4 N 

max[0, sign (, k f , mm - )] M fyp > 0 


(54a) 


(54b) 


* yv =iy,„ om - T lyvW ft . for v=2, 3, 4, .... At (57b) 

t i v=l l v n a m - r \fy for v =2, 3, 4 , ,N (57c) 

The min( ) and max( ) functions return, respectively, the 
minimum or maximum of their two input arguments. The 
variable r| /v is normally zero, in which case equations (57b) 
and (57c) leave k^ equal to k^ TOm and/< v equal to fi vnom . The 
value of n /v is -1 if Ak /h , ; ,<0 and k /v „ om <k >I1 „, and +1 if 
Ak ftyp >0 and In both of these cases causes 

equation (57b) and (57c) to perform the necessary adjust- 
ments to k^ and /i v . Note that efficient code may not execute 
the conditional in equation (57a) once per data word. 
Instead, its accumulation iterations could be performed in 
one of three different iterative loops, depending on the value 
of Akfc, . Additional economies can be had in the first and 
third conditional clauses of equation (57a). The value of 
~f\f v for the first condition is equal to the sign bit of the 2’s 
compliment representation of k /v „ om -k >Ii>1 . Similarly, +ry v 
for the third condition is equal to the sign bit of the 2’s 
compliment representation of ky^-k^,,,,. In either case, 
ry v (or its negative) can be computed in two operations. 

Summarizing real-time over-sampled PRN code genera- 
tor 30A (FIG. 3A) and referring now to FIG. 7, to compute 
prompt PRN code 29 (FIG. 3A) and early-minus-late PRN 
code 35 (FIG. 3 A) for an entire accumulation interval, the 
method includes the steps of iterating equations (44a)-(44c) 
(method step 201, FIG. 7) to construct the table of Ai(/<) 
values. The method further includes the step of computing 
the auxiliary constants (method step 203, FIG. 7) in equa- 
tions (48a)-(48e) and (50a)-(50e). The method further 
includes the step of initializing k^ and fi 1 (method step 205, 
FIG. 7) by evaluating equations (51a)-(52b). The method 
further includes the step of iterating equations (53a), (53b), 
(57a)-(57c), (49), (55), and (45) (method step 207, FIG. 7) 
to compute, for each iteration, fi vnom , r^, k^, fi v , k^, 

and y. The method further includes the step of iterating 
equations (56a)-(56c) (method step 209, FIG. 7) to com- 
pute, for each iteration, x^, x emlzmv , and x eml2pv . 

As mentioned already, it may prove efficient to interleave 
the equations (44a-c) iterations and the accompanying shift 
register iterations between the iterations that compute V- 
through x eml2sw . In this scenario can be computed from 
equation (57c). Afterwards, the shift register iterations that 
generate code chips C(m v _ 1 -1) though C(« v -1) can be 
performed, and these chip values can be used to iterate 
equations (44a-c) from l u v _ 1 to in order to determine 
Ai (// v ) from Ai(zr v _ , ). 


(57a) 


Ifi = 
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The software correlator 19 (FIG. 1) of the present inven- 
tion can advantageously be easily modified to work with 
signals at different frequencies, new PRN codes, or even 
signals for different types of devices. Thus, the same hard- 
ware could use the software correlator 19 (FIG. 1) to 5 
implement such devices as a GPS receiver, a cell phone, or 
both. To allow for new codes, new frequencies, and new 
types of functionality, small changes can be made in the 
software correlator 19 (FIG. 1), or different versions of the 
software correlator 19 (FIG. 1) can be run on the same to 
processor. The changes involve using a different baseband 
mixing frequency and a different PRN code in the correla- 
tion, and perhaps changes that would provide the new 
signals of interest to the software correlator 19 (FIG. 1). In 
order for the present invention to work with signals at is 
different frequencies, new PRN codes, or signals for differ- 
ent devices, two fundamental changes need to be made. 
First, the baseband mixing frequency must be tailored to that 
of the signal data 21, which also involves pre-computing and 
storing sine and cosine tables at this new frequency. Second, 20 
new pre-computed over-sampled PRN code tables 28 (FIG. 
3A) must be constructed. The size of the new tables should 
match the over-sampled accumulation period, or at least one 
over-sampled period of the PRN code. As an alternative to 
generating new pre-computed over-sampled PRN code 25 
tables 28 (FIG. 3A), the new PRN codes can be generated in 
real-time by over-sampled PRN code generator 30A (FIG. 
3A). Also, the system and method of the present invention 
could be implemented within systems such as GLONASS 
receivers, cell phones and cell base stations, pagers, wireless .to 
Ethernet (e.g. 802. llx standards), Bluetooth™, Black- 
berry® wireless internet devices, and satellite radio/phones 
(e.g. INMARSAT®). In fact, the system and method of the 
present invention are applicable to any sort of telecommu- 
nication system/device that uses spread spectrum, code 35 
division multiple access (CDMA) pseudo random number 
codes for the transmission of information, either wired or 
wireless. 

Referring now to FIG. 8, navigation calculations require 
measured values of the PRN code phase 55, carrier phase, 40 
and carrier frequency. The measurements for all tracked 
satellites must be taken at exactly the same time. A time 
interval counter (TIC) function provides a periodic timing 
scheme to synchronize these measurements at time t r/c 56. 

At time i TIC 56, the TIC function latches all of the PRN code 45 
phase 55, carrier phases, and carrier frequencies along with 
the code epoch counters, and software correlator 19 (FIG. 1) 
makes these available to application-specific code 15 (FIG. 

1), for example, GPS receiver software. GPS receiver soft- 
ware uses the code phase and epoch counters to compute the 50 
pseudo range to each satellite. Software correlator 19 (FIG. 

1) keeps track of the code and carrier phase of each signal 
as determined by the code chipping rate and the carrier 
Doppler shift inputs. The quantity l cjk , the estimated code 
chipping rate of software receiver 10 for satellite j during its 55 
k r, ‘ PRN code period, can be determined either by an 
acquisition search procedure, or if tracking, by a delay- 


locked loop. Likewise, u> Doppjk , the associated carrier Dop- 
pler shift, can be defined by an acquisition procedure or, if 
tracking has commenced, by a phase-locked loop or a 
frequency-locked loop. These determinations are made by 
application-specific code 15 (FIG. 1). Software correlator 19 
(FIG. 1) can use these two frequencies to update quantities 
that keep track of its code and carrier phases according to the 
formulas: 


1023 (58) 

T jk l l - ' T jk + — 

f cjk 

U 1 1 i( T /*i-i (59) 

Software correlator 19 (FIG. 1) can keep a running track 
of these quantities and can initialize these iterations as part 
of the signal acquisition calculations that it carries out in 
conjunction with application-specific code 15 (FIG. 1). The 
quantities t j0 and <j> i0 are either sent to software correlator 19 
by application-specific code 15, or they are initialized arbi- 
trarily by software correlator 19 and application-specific 
code 15 executes feedback control of f - k and (x) Doppjk to force 
the sequences defined by equations (58) and (59) to con- 
verge to appropriate values. Information about the previ- 
ously-described conventional method can be found in Dier- 
endonck. 

The TIC time \ TIC 56 (FIG. 8) can occur at, for example, 
the millisecond boundaries of the receiver clock. At each 
time l TIC 56, the PRN code phase 55 (FIG. 8) of each signal 
is computed in the following manner: 


'P jnc - 1 0 2 3 


tnc - ? jk+l 

Tjk+2 -Tjk+l 


(60) 


where rj ) ]TIC is the PRN code phase 55 (FIG. 8) in chips of 
signal j at TIC time t T,c 56 (FIG. 8). The epoch counters, 
which are simply a running total of the number of code 
periods 57 (FIG. 8), are incremented at each code start/stop 
time. 

The carrier phase calculation at time t TIC 56 (FIG. 8) is 
similar to the PRN code phase 55 (FIG. 8) calculation: 

L IK = ty I l +ta Doppjk+l(t T I<r~'tjk+ 1) (61) 

where (|i ///c is the carrier phase at time t TIC 56. The Doppler 
shift that gets returned at time t TIC 56 (FIG. 8) is to Dopp]k+1 . 

With respect to the performance of the system and method 
of the present invention, a sample screen-shot from the 
illustrative embodiment of the present invention is provided 
in Table 8. This table illustratively shows the tracking of 
nine channels. The roof-mounted LI antenna of the illus- 
trative embodiment can have a pre-amp with 26 dB of gain. 
The software correlator 19 (FIG. 1) of the present invention 
can provide positional accuracy on the order of 10-15 
meters when working in conjunction with application spe- 
cific software 15 (FIG. 1). 


TABLE 8 


Numerical GPS tracking data output by the illustrative embodiment. 


Lat 

42.44354 

Spd 

0.5 

SVs 

8 

Ctrack 

FLL 

Date 

17/10/02 

Lon 

-76.48143 

Hdg 

327.0 

Nav 

3D 

GDOP 

1.9 

GPS 

19:58:11 

Alt 

269.6560 

ROC 

-0.7 

HI 


DO 

-393.0 

OscErr 

0.25 


ELEV 
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TABLE 8-continued 




Numerical GPS tracking data output by the illustrative embodiment. 

CH 

sv 

ELY ATI DOPP NCO UERE SF PRerr PRRerr LOCK SNR iS4 


1 

1 

67 

237 

-528 

-927 

4 

1 

9.0 

0.7 

CCBF 

16.9 

- 1.000 

2 

22 

64 

47 

-1644 

-2045 

4 

1 

14.3 

0.6 

CCBF 

18.0 

- 1.000 

3 

3 

50 

152 

2174 

1778 

2 

1 

5.5 

-0.4 

CCBF 

18.4 

- 1.000 

4 

25 

22 

106 

-2650 

-3050 

2 

1 

42.9 

0.2 

CCBF 

13.2 

- 1.000 

5 

17 

1 

62 

1722 

1331 

2 

1 

7.5 

-0.4 

CCBF 

8.1 

- 1.000 

6 

15 

2 

81 

2278 

1887 

2 

1 

-5.2 

-0.7 

CCBF 

7.4 

- 1.000 

7 

27 

12 

295 

2969 

2575 

0 

1 

0.0 

0.0 

CCBF 

8.0 

- 1.000 

8 

13 

44 

303 

1856 

1866 

2 

0 

0.0 

0.0 

C 

14.9 

- 1.000 

9 

10 

31 

22 

185 

3860 

3464 

2 

1 

-11.0 

-0.5 

CCBF 

15.9 

- 1.000 

11 

12 

20 

4 

219 

-3086 

-3483 

2 

1 

27.5 

0.3 

CCBF 

9.6 

- 1.000 


Two comparison tests illustrate the performance of the 
system and method of the present invention. In the first test, 
a first configuration includes a MITEL® GP2021 hardware 
correlator, but is in all other ways identical to a second 
configuration that includes the software correlator 19 (FIG. 
1) of the present invention. The two configurations differ in 
SNR by less that 1 dB and in navigation solutions by no 
more than 5-10 meters. In the second test, timing studies 
using the system of the present invention show that pro- 
cessing six channels uses only about 20% of the processor’s 
capacity, while Akos 2001a report a real-time software GPS 
receiver that would require 100% of the capacity a 1.73 GHz 
microprocessor to implement a 6-channel GPS receiver 
when processing data from an RF front-end with a sampling 
frequency of 5.714 MHz. 

Referring now to FIGS. 9A and 9B, among other indica- 
tors that could assess the accuracy of the PRN code gener- 
ated by real-time over-sampled PRN code generator 30A 
(FIG. 3 A), which includes prompt PRN code sign 29 A (FIG. 
3A), early-minus-late PRN code sign 35A (FIG. 3A), and 
early-minus-late PRN code zero mask 35B (FIG. 3A), is the 
low distortion of the generated codes versus the true codes. 
FIG. 9A, generated for prompt code comparisons, shows 
juxtaposed plots of the autocorrelation function 80 (FIG. 
9A) of the sampled true code and the cross correlation 
function 82 (FIG. 9A) between the sampled true code and 
the sampled code as generated from the new x p table. FIG. 
9B shows a similar comparison for early-minus-late PRN 
code 35 (FIG. 3A). In either case, the nominal chipping 
frequency is f c „ om =l/At c „ om =1.023 MHz, the sampling fre- 
quency is f J =l/A( s =12.199 MHz, and the code timing reso- 
lution of the table is m=12, i.e., V 12 "' of a sample. This 
resolution translates into approximately V'143”' of a code 
chip. The actual chipping rate differs from the nominal by 
(f c — f CMOm )= 1.5 Hz. This Doppler shift of the code chipping 
rate corresponds to a significant non-zero range rate between 
the receiver and the transmitter, 438 m/sec. The correlations 
are accumulated over 10230 code chips, and the code is a 
time-multiplexed version of a pair of the new GPS civilian 
L2 CL and CM codes. The data word indexing calculations 
for this example use a code chip start time resolution of 
A 4 /IO 5 , i.e., my=10 5 . 

It is obvious from FIGS. 9A and 9B that prompt PRN code 
29 (FIG. 3A) and early-minus-late PRN code 35 (FIG. 3A) 
distortion is very small. The correlations produced using the 
new tables are virtually identical to those produced from the 
exact code. The low distortion of the new approach is best 
characterized by two parameters: the amount by which the 
correlation peak of the prompt PRN code sign 29A (FIG. 


3A) droops below one and the offset of the zero-crossing 
time of the early-minus-late PRN code 35 (FIG. 3A). The 
former metric characterizes the power loss of the new 
approach, and the latter metric characterizes the net timing 
error. The droop of the prompt peak is only 0.3% of the 
nominal amplitude, which translates into a 0.03 dB loss. The 
timing distortion of the new code is less than 4xl0~ 4 code 
chips. This distortion is very small; it translates into about 
0.1 m of GPS range measurement error. 

It may seem paradoxical that the code timing error is only 
4xl0 -4 code chips when the code timing granularity of the 
x(i) tables is A4(mAt £ .)a7xlO~ 3 code chips. The resolution of 
this paradox lies in the averaging effect of the accumula- 
tions. The length of a code chip equals 143.099269 code 
offset time grid intervals for the example shown in FIGS. 9A 
and 9B. The non-integer nature of this number causes the 
code offset errors of the x(i) tables to get dithered as the 
accumulation works its way through successive data words. 
This dithering tends to average out the table granularity 
errors, and this averaging can reduce the net timing error by 
an order of magnitude or more, as shown in FIG. 9B. 

Although the invention has been described with respect to 
various embodiments, it should be realized this invention is 
also capable of a wide variety of further and other embodi- 
ments. 

What is claimed is: 

1. A software receiver comprising: 

a receiver capable of receiving a radio signal; 

means for digitizing the radio signal; and 

means for using bit-wise parallelism to provide a software 
correlator capable of mixing the digitized radio signal 
to form a baseband signal. 

2. The software receiver of claim 1 wherein said software 
correlator comprises: 

means for computing correlations between the baseband 
signal and at least one pseudo-random number (PRN) 
code using the bit-wise parallelism. 

3. The software receiver of claim 2 wherein said software 
correlator further comprises: 

means for computing accumulations from the correlations 
using the bit-wise parallelism. 

4. The software receiver of claim 3 further comprising: 

application-specific code capable of computing naviga- 
tion data using the accumulations. 

5. The software receiver of claim 1 wherein said means 
for digitizing comprises: 

means for down-converting the radio signal to an inter- 
mediate frequency; and 
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a digitizer capable of digitizing the intermediate fre- 
quency. 

6. The software receiver of claim 5 wherein said digitizer 
produces at least one bit/sample. 

7. The software receiver of claim 5 wherein said digitizer 
is an analog to digital converter. 

8. The software receiver of claim 1 wherein said bit-wise 
parallelism accommodates multi-bit signals. 

9. A method for processing at least one radio frequency 
(RF) signal over an accumulation period comprising the 
steps of: 

receiving at least one RF signal; 

mixing the at least one RF signal to form a baseband 
mixed signal using bit-wise parallelism; 
computing a fully mixed prompt integrand as a function 
of the baseband mixed signal and a pre-selected 
pseudo-random number (PRN) code using the bit-wise 
parallelism; 

computing a fully mixed early-minus-late integrand as a 
function of the baseband mixed signal and the pre- 
selected PRN code using the bit-wise parallelism; 
computing in-phase summed accumulations and quadra- 
ture summed accumulations over the accumulation 
period, the in-phase summed accumulations and 
quadrature summed accumulations being functions of 
the fully mixed prompt integrand and the fully mixed 
early-minus-late integrand; and 
rotating the in-phase summed accumulations and quadra- 
ture summed accumulations to correct for effects of 
frequency and phase granularity of the baseband mixed 
signal. 

10. A communications network comprising at least one 
node for carrying out the method according to claim 9. 

11. A computer data signal embodied in electromagnetic 
signals traveling over a computer network carrying infor- 
mation capable of causing a computer system in the network 
to practice the method of claim 9. 

12. A computer readable medium having instructions 
embodied therein for the practice of the method of claim 9. 

13. The method of claim 9 wherein said step of computing 
a fully mixed prompt integrand comprises the step of: 

generating the pre-selected pseudo-random number 
(PRN) code using the bit-wise parallelism, said step of 
generating the pre-selected PRN code comprising the 
steps of: 

formulating a tabulated function for use in translating 
code chip and timing values into PRN code using the 
bit-wise parallelism; 

generating at least one prompt PRN code in real-time; 
choosing at least one chip value from the at least one 
prompt PRN code, the at least one chip value cor- 
responding to at least one data interval that contains 
at least one sample of a data word, the at least one 
chip value having a known timing relative to the at 
least one data interval; 

transforming the known timing into a time grid index; 
and 

translating the at least one chip value and the time grid 
index during the at least one data interval into the 
PRN code using the bit-wise parallelism for the at 
least one data interval, said step of translating result- 
ing from the use of the tabulated function. 

14. The method of claim 13 further comprising the step of: 
computing the time grid index as a function of a time 

offset index k,, and an auxiliary table index /u v . 
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15. The method of claim 13 further comprising the step of: 
computing the time grid index iteratively as a function of 

a previously-computed time grid index, the at least one 
prompt PRN code, and the timing values associated 

5 with the at least one prompt PRN code. 

16. A software receiver comprising: 

a front-end device capable of receiving a radio signal, said 
front-end device capable of converting the radio signal 
into signal data; 

10 a data acquisition device capable of receiving the signal 
data, said data acquisition device capable of providing 
the signal data to a microprocessor; 
at least one shift register capable of packing the signal 
data into at least one data word; 

IS a baseband mixer capable of computing at least one 
baseband mixed signal as a function of the at least one 
data word by using bit-wise parallel processing; 
a correlator capable of computing correlations between 
the baseband mixed signal and a pseudo-random num- 

20 ber (PRN) code replica signal; and 

an accumulator capable of computing summed accumu- 
lations by accumulating the correlations. 

17. The software receiver of claim 16 wherein the signal 
data further comprises a signal sign. 

2: ' 18. The software receiver of claim 16 wherein the signal 

data further comprises a signal sign and at least one signal 
magnitude. 

19. The software receiver of claim 16 wherein said 
correlator uses the bit-wise parallel processing for comput- 

30 ing the correlations. 

20. The software receiver of claim 16 wherein said 
accumulator accumulates the correlations by an electronic 
mechanism. 

21. The software receiver of claim 16 wherein the radio 
signal can be received from a global positioning source. 

22. The software receiver of claim 16 wherein said 
correlator can be adapted to perform functions selected from 
a group consisting of accepting the radio signal at any 
frequency, accepting any PRN code, and accepting the radio 
signal from any device that generates a radio signal. 

23. The method of claim 9 wherein said step of computing 
in-phase and quadrature summed accumulations comprises 
the steps of: 

4 _ representing a carrier replica signal from at least one 
channel from a plurality of channels as a carrier replica 
sign and a carrier replica magnitude; 
representing the at least one RF signal from the at least 
one channel of the plurality of channels as at least one 

50 signal word; 

computing a baseband mixed sign as a function of the 
carrier replica sign and the at least one signal word; 
computing a baseband mixed magnitude as a function of 
the carrier replica magnitude; 

55 selecting a pseudo-random number (PRN) code having a 
prompt PRN code and an early-minus-late PRN code; 
representing the prompt PRN code as a prompt PRN code 
sign; 

computing a fully mixed prompt integrand sign as a 

60 function of the baseband mixed sign and the prompt 
PRN code sign; 

representing the early-minus-late PRN code as an early- 
minus-late PRN code sign and an early-minus-late PRN 
code zero mask; 

65 computing a fully mixed early-minus-late integrand sign 
as a function of the baseband mixed sign and the 
early-minus-late PRN code sign; 
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computing at least one set of prompt integrand value 
words as a function of the fully mixed prompt integrand 
sign and the baseband mixed magnitude; 
computing at least one set of early-minus-late integrand 
value words as a function of the fully mixed early- 5 
minus-late integrand sign, the baseband mixed magni- 
tude, and early-minus-late PRN code zero mask; 
computing prompt in-phase and quadrature summed 
accumulations for the plurality of channels for an 
accumulation interval as functions of the number of to 
significant bits in the at least one set of prompt inte- 
grand value words and as functions of the values 
associated with the at least one set of prompt integrand 
value words; and 

computing early-minus-late in-phase and quadrature is 
summed accumulations for the plurality of channels as 
functions of the number of significant bits in the at least 
one set of early-minus-late integrand value words and 
as functions of the values associated with the at least 
one set of early-minus-late integrand value words. 
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24 . The method of claim 23 further comprising the step of: 
selecting the significant bits from a group consisting of 

zeros and ones. 

25 . The method of claim 23 further comprising the step of: 
retrieving the carrier replica signal from a carrier replica 

table, the carrier replica table representing a coarse grid 
of frequencies. 

26 . The method of claim 23 further comprising the step of: 
representing the signal word from the at least one channel 

as a signal sign and a signal magnitude; and 
computing at least one baseband mixed magnitude as a 
function of the carrier replica magnitude and the signal 
magnitude. 

27 . The method of claim 26 further comprising the step of: 
retrieving the carrier replica signal from a carrier replica 

table, the carrier replica table representing a coarse grid 
of frequencies. 



